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Erythropoietin Muteins With Enhanced Activity 

This invention was made with Government Support under grant nos. 
R01-HL42949 and R01-GM39900 sponsored by the National Institute of 
Health. The Government has certain rights in the invention. 

5 Cross-Reference to Related Applications 

This application is a continuation-in-part of U.S. Patent Application 
Serial No. 08/049,802 filed April 21. 1993 which is incorporated herein by 
reference. 

Background 

10 This invention relates to erythropoietin in general and more particularly 

to modified erythropoietin proteins (erythropoietin muteins) having improved 
biological activity. 

Erythropoietin (hereafter referred to as Epo) is a glycoprotein hormone 
which, in its mature form in humans, is 166 amino acids long with a 

15 molecular weight of 34 to 38 kD (Jacobs et al., Nature 31 3: 806-809 (1985). 
^ occurs in a, /3, aixl asialo forms which differ slightly in their 
carbohydrate composition and biological activity (Dordal etal., Endocrinology 
116: 2293-2299 (1985)). 

In terms of biological function, Epo has been recognized as the 

20 hematopoietic cytokine that regulates the process of red blood cell production, 
known as erythropoiesis. Erythropoiesis is a controlled physiological process 
which normally produces red blood cells in numbers which do not impede 
blood flow, but which are sufficient for oxygen transport. 

The binding of Epo to its cognate receptor (D'Andrea, A.D., et al, 

25 Cell 57:277-285 (1989)) on erythroid precursor cells in the bone marrow 
results in salvaging these cells from programmed cell death, known as 
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apoptosis (Koury, M.J., et ai, Science 248:378-381 (1990)), allowing them 
to proliferate and differentiate into circulating erythrocytes (red blood cells). 
Epo thus facilitates the formation of erythrocytes from erythroid precursor 
cells. 

5 The level of 1^ in circulation normally controls the rate of red blood 

cell production in the body. Typically, Epo is present at a low plasma 
concentration sufficient to maintain a steady state concentration of red blood 
cells by stimulating formation of just enough red blood cells to replace those 
lost through normal processes, such as aging. However, when an increase in 

10 the production of red blood cells is needed, the amount of Epo in circulation 
is increased. This red blood cell deficit may be caused, for example, by 
anemia, the general loss of blood through hemonliage, over-exposure to 
radiation, prolonged unconsciousness, or exposure to high altitudes where 
oxygen intake is reduced. 

15 In mammals, Epo is produced in the fetal liver and adult kidney, 

circulates in the bloodstream, and binds to receptors on committed progenitor 
cells in the bone marrow and other hematopoietic tissues, resulting in 
proliferation and terminal maturation of erythroid cells (Jelkmann, W., 
Physiol. Rev. 72:449 (1992)). The expression of Epo, both mRNA and 

20 protein, is markedly increased by hypoxia, owing to a 3' enhancer and to 
highly conserved elements in the promoter region (Semenza, G.L. etal., 
Proc. Natl. Acad. Sci. USA 58:5680-1684 (1988); Beck, I. etal., J. Biol. 
Chem. 266:15563-15566 (1991); Pugh. C.W. etal., Proc. Nail. Acad. Sci. 
USA 88:10553-10557; Blanchard, K.L. et al., Mol. Cell Biol. 72:5373-5385 

25 (1992)). This elegant servomechanism enables Epo to regulate the red cell 
mass of man and other animals. 

Genes encoding Epo from a number of species have been studied. 
Thus far, the Epo genes of man (Jacobs. K. et al. Nature 575:806-10 (1985); 
Lin, F.-K. etal., Proc. Natl. Acad. Sci. USA 82:7580-7584 (1985)), a 

30 monkey {Macaca fascicularis) (Lin, F.-K. et al. , Gene 44:201-209 (1986)) and 
a rodent, the mouse (McDonald. J.D. et al., Mol. 4 Cell Biol. 6:842-848 
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(1986); Shoemaker, C.B. et aL, Mol. & Cell Biol. 6:849-858 (1986)) have 
been cloned, sequenced, and expressed. There is a high degree of sequence 
homology in the coding region of the mature secreted proteins from these 
species. 

5 In particular, the gene encoding human Epo has been cloned and used 

to produce recombinant Epo in cell culture. Recombinantly produced human 
Epo has general utility as a substitute for Epo isolated from natural sources. 
In addition, recombinant Epo has facilitated the production of large amounts 
of Epo free from deleterious substances associated with naturally derived Epo, 

10 which has allowed for greater availability and utility of this protein. 

Recombinantly produced Epo has proven especially useful for the 
treatment of patients suffering from impaired red blood cell production 
(Physicians Desk Reference (PDR). 1993 edition, pp 602-605). Recombinant 
Epo has proven effective in treating anemia associated with chronic renal 

15 failure and HIV-infected individuals suffering from lowered endogenous Epo 
levels related to therapy with Zidovudine (AZT) (See PDR, 1993 edition, at 
page 602). 

Modifications of the Epo protein which would improve its utility as a 
tool for diagnosis or treaunent of blood disorders are ceruinly desirable. In 
20 particular, modified forms of Epo exhibiting enhanced biological activity 
would be more effective and efficient than native Epo in the therapy setting 
when it is necessary to administer Epo to the patient, enabling administration 
less frequently and/or at a lower dose. Administration of reduced amounts of 
Epo would also presumably reduce the risk of adverse effects associated with 
25 Epo treatment, such as hypertension, seizures, headaches, etc. (See PDR, 
1993 edition, at pp. 603-604). 

Unfortunately, available information regarding the structure of the Epo 
enzyme and its function does not permit the accurate prediction of 
modifications which would result in such improvements. In fact, the highly 
30 conserved nature of the amino acid sequence of Epo indicates that this protein 
is. for the most part, resistant to change and that most modifications would be 
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expected to have deleterious effects on Epo and its activity (See U.S. Patent 
No. 4.835.260 by Shoemaker). 

Some speculation does, however, exist regarding the possibility of 
creating variants of Epo with altered characteristics that would be preferable 

5 to the wild type protein for panicular purposes. A number of modifications 
of Epo have been prqwsed. including deletions, additions, and substitution of 
existing amino acids. These modifications have been proposed as a means of 
achieving a variety of effects, including separation of various Epo functions 
on individual protein fragments, improving the efficacy of recombinant Epo 

10 production, improving in vivo stability, etc. Of particular relevance to the 
present invention are those modifications proposed to improve the biological 
activity (i.e. ability to stimulate red blood cell production) of Epo. 

U.S. Patent No. 4,703,008 by Lin, F-K. (hereinafter referred to as 
"the "008 patent") speculates about a wide variety of modifications of Epo. 

15 including addition, deletion, and substitution analogs of Epo. The '008 patent 
does not indicate that any of the suggested modifications would increase 
biological activity per se, although it is stated that deletion of glycosylation 
sites might increase the activity of Epo produced in yeast (See the '008 patent 
at column 37, lines 25-28). Also, the '008 patent speculates that Epo analogs 

20 which have one or more tyrosine residues replaced with phenylalanine may 
exhibit an increased or decreased receptor binding affinity. 

Australian Patent Application No, AU-A-59145/90 by Fibi. M et al. 
(herein referred to as "Fibi") also discusses a number of modified Epo proteins 
(Epo muteins). Fibi generally speculates about the alteration of amino acids 

25 10-55, 70-85. and 130-166 of Epo. In particular, additions of positively 
charged basic amino acids in the carboxyl terminal region are purported to 
increase the biological activity of Epo. 

U.S. Paieni No. 4,835,260 by Siiucmakci, C. B. diiCuSScS uiOuificd 
Epo proteins with amino acid substitutions of the methionine at position 54 and 

30 asparagine at position 38. Such Epo muteins are thought to have improved 
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Stability but are not proposed to exhibit any increase in biological activity 
relative to wild type Epo. 

Beyond the few proposed modifications that have been speculated to 
enhance the biological activity of Epo, the art provides no further guidance 
5 regarding how Epo can be modified to increase its biological activity. This 
lack of guidance indicates that, based upon the present state of knowledge, 
other modifications of Epo which may increase biological activity cannot be 
predicted and may not even exist. 

The present invention represents the inventors' achievement in 
10 overcoming the lack of guidance and unpredictability regarding modifications 
of the Epo protein which increase its biological activity. 

Summary Of The Invention 

It is therefore an object of the present invention to provide Epo muteins 
with enhanced biological activity. 
15 It is a further object of the present invention to provide recombinant 

DNA encoding Epo muteins and methods of producing biologically active Epo 
muteins in a host cell culture using such recombinant DNA. 

It is a further object of the present invention to provide methods of 
treating blood disorders involving abnormally low red blood cell populations 
20 using Epo muteins at dosages lower than that required for unmodified Epo to 
achieve the same affect. 

It is a further object of the present invention to provide a method of 
obtaining additional Epo muteins with enhanced biological activity. 
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Brief Description Of The Drawings 

Figure 1. Oligonucleotide Primer Sequences. Oligonucleotide 
primers corresponding to Epo DNA sequences are shown. The localization 
of the primer coincides with previously published nucleotide sequences for the 

5 human (Lin, F.-K. et al., Proc. Natl. Acad. Sci. USA 82:7580-7584 (1985)) 
and murine (McDonald, J.D. et al.,MoL & Cell Biol. 6:842-848 (1986)) Epo 
genes. EX2R and EX5 are not completely conserved between human and 
mouse (respectively 93% and 95% identity). An equal amount of each 
possible nucleotide was incorporated during the corresponding cycles of those 

10 primer syntheses. EX2R and NCOl are reverse primers and their sequences 
represent the antisense DNA strand. 

Figure 2. PGR Strategy used for the cloning of mammalian 
cDNAs containing the complete coding sequence of the mature 
Erythropoietin Protein. Genomic amplification and sequencing of exonic 

15 fragments, localized upstream and downstream from the nucleotide ponion 
coding for the mature protein, allowed the design of species specific primers 
(SP). Utilization of those SP primers and/or of, sequences that are 100% 
conserved between man and mouse 5' ATG and 3' NCOl primers (Figure 1) 
on cDNA templates prepared from kidney of uninduced or hypoxia-induced 

20 animals, allows the amplification of a large variety of mammalian Epo clones. 
Sense (-♦) and antisense (*-) primers are represented by the arrows. Dashed 
boxes correspond to the coding part (propeptide and mature protein) of the 
five Epo exons. Gray boxes represent the 5' and 3' untranslated regions. 
Figure 3. Comparison of rVl/EX2R sequences from various 

25 mammals. The numbering corresponds to the published human genomic 
sequence (Lin, F.-K. etai, Proc. Natl. Acad. Sci. USA 52:7580-7584 
(1985)). The reported human sequence was obtained from the amplification 
of purified genomic DNA from Hep3B cells and agreed with the sequence 
previously reported. The mouse sequence is from McDonald, J.D. eiai, 

30 Mol. <t Cell Biol. 6:842-848 (1986). The horse sequence was obtained from 
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icidney-cxtracted genomic DNA. All the other sequences were established 
from PCR amplification of genomic DN As purified from several mammalian 
renal-derived cell lines. The consensus sequence indicates the positions where 
a unique nucleotide was found in all the reported species. The boundary 
between iniron 1 and exon 2 is represented by the ascending arrows. The 
locations of the two PCR pnmers. IVl and EX2R are shown by the dashed 



arrows. 



Figure 4. Alignment of the nucleotide sequences of mammalian 
Epo cDNAs. The mouse sequence corresponds to the one previously reported 
(McDonald. J.D. et al., Mol. & Cell Biol. 6:842-848 (1986)). PCR-produced 
human sequence was obtained from hypoxia-induced Hep3B cell line and is in 
total agreement with the previously published sequence (Jacobs. K. et al, 
mure 573:806-810 (1985); Lin, F.-K. et al., Proc. Natl. Acad. Sd. USA 
82:7580-7584 (1985)). Monkey (Macaco mulatta), rat. sheep, pig and cat 
were amplified from kidney-purified cDNAs. Amplifications of the human, 
monkey, rat and sheep were realized using the ATG (residues 1 to 20) and 
NCOl (residues 723-742) primers. The line under residues 41 to 56 
represents the specific SPl primer used for the cloning of the pig and cat 
sequences. For these two mammals. 5' sequences for SPl are derived from 
the data presented in Figure 3. Arrows indicate the start and the end of the 
coding sequence (propeptide and mature protein). 

Figure 5. Predicted Amino Acid Sequences of Epo Propeptide. 
The amino acid sequences are derived from data obtained from both genomic 
IV1/EX2R and cDNA amplifications. The numbering corresponds to the 
human sequence. Ala 1 is the first amino acid of the human mature protein. 
The ascending arrows show the site of the cleavage by the signal peptidase, 
as determined for the human and cynomolgus monkey proteins. 

Figure 6. AUgnment of the Primary Structures of mature 
mammalian Epo Proteins. The human, Macaca fascicularis and mouse 
) amino acid sequences were previously reponed (Lin, F.-K. et al. , Proc. Nail. 
Acad. Sci. USA 82:7580-7584 (1985); Lin. F.-K. et al.. Gene 44:201-209 
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(1986); McDonald. J.D. et at., Mol. & Cell Biol. 6:842-848 (1986)). The 
residue numbers correspond to the 166 aa of the mature human proteir. Plain 
boxes indicate the positions of the predicted four «-helices in the human 
sequence (See Example II). Dashed boxes show the N- and 0-glycosylation 
sites. Limits between exons are indicated by the vertical lines. 

Figure 6A. Schematic represenUtion of the mammal 
erythropoietin. The invariant amino acids among the eight sequences shown 
in Figure 6 are represented by the black boxes. The localization of each of 
the four a-helices is underlined. The three glycosylation sites are shown as 
diamonds. The main disulfide bridge is indicated by the heavy line connecting 
noncontiguous sequences. The small arrows under the sequence indicate the 
short SH-bridge. missing in the rodents. 

FiBuns 7-7B. Phylogenetic lineages derived from analyses of 
mammalian Epo cDNA sequences encodmg 
the full length mature prote'ms from seven 



Figure 7. Strength of groups in the maximum parsunony tree 
found on examining all 945 unrooted trees formed by seven terminal taxa. 

This tree of lowest length required 374 base substitutions. Each link between 
ancestral nodes has a circled number; this strength of grouping number is the 
minimum number of substitutions that must be added to the length of the 
maximum parsimony tree to find a tree that breaks down the barrier (moves 
one or more sequences) between the two groups separated by the interior link. 

F^ure 7A. The phylogenetic tree derived from the maximum 
paramony reconstruction. On the basis of other molecular evidence 
involving comparative amino acid sequence data from monotremes, 
marsupials, and many eutherian species (Czelusniak, J. et al.. Meth. Enzymol. 
785:601-615 (1990); Czelusniak. J. etal.. In "Currem Mammology", 
Volume 2. 3rd ed.. H. H. Ganoways. pp. 545-572. Plenum Publ. Corp. 
(1990)) the root of this Epo phylogenetic tree is placed on the interior link 
separating the artiodactyl group from the primate, rodent, cat group. The 
numbers on the inicmodal links represent the numbers of base pairs by which 
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*e noda, ances.,.1 and descndam se,u.„ces m... MYA = ^il-s of 
v«.s ago as inferrol from paleomological v«ws of cuU,=ria. pt.ylog.ny. 

«j„«7B. Parsimony racooaruaion onlhat portion .f 
cDNA s«,uences to. .re codons for <»nino acids. The numto shown as 
a f«uon on each link are. in me nun,era«>r, *e number of amino acd 
changmg i=ase n=piacemen.s and. in *e denominator, d^e number of silcm base 
,ep,acemenu. The eompu«r ^gori*m to, carried ou. rhis calcuiauon .s 
described in C^lusniak, >.e,ol.. Ncu. 29M97-300 (1982) (see U,e legend 
for Figure 2). 

Fizum S-SA. PhylogeneticlineagesderivedframanalysK 
tigureso niammalian Epo intron 1-cxon 2 

sequences. 

Figure 8. Strength of groups in the maximum parsmiony tree 
found on examining all 135,135 trees forced by nme tenninal taxa. As 

, the two feloid sequences (those of cat and lion) are nearly identical , they were 
seated as a single taxon in this strength of grouping analysis. The maximum 
parsimony tree for the segment (about285bp)ofintronlandexon2from ten 

species required 361 base substitutions. The circled numbers are the strength 
of grouping numbers (as defined in the Figure 7 legend). 
0 Figure 8A. The near most parsimonious tree that groups dog 

withfeloids. This tree required 365 base substitutions. The numbers on links 
represent numbers of base pairs by which the nodal ancestral and descendant 
sequences differ. 

Figures 9-9B. Model of the Thrte-Dimensional Structure of 
2^ Erythropoietin. 

Figure 9. Ribbon diagram of Epo tertiary structure. The four a 
helices are labeled A to D; loops between helices are appropriately named. 
Disulfide bndges are shown and N- ano 0-giycosyiation sites are ind.ca.u 
respectively by dark and dotted segments. 
30 Figure 9A. Schematic representation of Epo's primary structure 

depicting predicted up-up-down-down onentation of the four anuparallel a 
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helices (boxes with arrowhead). This folding pattern is strongly suggested by 
the large size of the two interconnecting loops AB and CD. The limits of each 
helix were drawn according to Table II of Example 1. A predicted shon 
region of 0-sheet is delineated by the dashed rectangle. The N-glycosylation 
5 sites are represented by the dotted diamonds, the O-glycosylation site by the 
dashed oval. The locations of the two disulfide bridges are shown as solid 
lines. 

Figure 9B. Cross-section of the Epo molecule at the level of the 
four a helices. The helical wheel projections are viewed from the NH,-€nd 

10 of each helix. The hydrophobic residues, localized inside the globular 
structure, arc indicated by filled circles. The charged and neutral residues 
(open and gray circles respectively) are exposed at the surface of the molecule. 

Figure 10. Immunopredpitations of wild type Epo and the 
A140-144 mutein. Cos? cells were transfected with pSG5, pSG5-Epo/wt or 

15 pSG5-Epo/A140-144. After three days, the cells were metabolically labeled 
with ["S]-methionine and ["S]-cysteine. Immunoprecipitations of cellular 
extracts and supematanu were performed with our polyclonal antibody, raised 
in rabbit against the native human Epo. The immunoprecipitates were 
analyzed by SDS-polyacrylamide gel electrophoresis. Lanes 2 to 4 correspond 

20 to cellular extracts, and lanes 5 to 7 correspond to culture supematants, from 
Cos7 transformed with the following: plasmid without insert (lanes 2 and 5), 
wild type Epo (lanes 3 and 6), and A140-144 (lanes 4 and 7). Lane 1 
represents the protein molecular weight standard. The two arrows show the 
normal secretion of the wild type Epo (35-37 kD) and the cytoplasmic 

25 retention of the mutein A140 = 144 (-28 kD). 
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Figures ll-llB. Interconnecting Loop AB. 
Figure 77. Schematic represenution of the loop AB showing the 
localization of muteins with various deletions and amino acid 
replacements. The dashed arrows point to the posiuons of the serine 
substitutions (in A48-52). The two N-giycosylation sites are represented by 
the gray diamonds. The small Cys29=Cys33 disulfide bridge is indicated. 

Figure llA. Amount and biological activities of secreted muteins. 
The upper bar graphs show the relative secretion of wild type and loop AB 
muteins as determined by radioimmunoassay. The lowest bar graphs display 
the calculated specific activity (ratio bioassay/RlA) for each mutein. in 
comparison with the value obtained for the wild type Epo (ratio = 100%). 

Figure I IB. HCD57 ceU proliferation as a function of increasing 
concentration of wUd type and serine-substituted Epo muteins. HCD57 
cells (10* per ml) were cultured for three days in a % well microliter plate 
with media containing increasing concentrations of secreted proteins 
(mU =milli-units of Epo, a relative mass measurement). The line graphs show 
the cellular growth as measured by 'H-thymidinc uptake for cells cultured with 
wild type Epo (O), Epo mutein F48S (O). Epo mutein Y49S (X). Epo mutein 
A50S ( ♦ ), Epo mutein W51S (□), and Epo mutein K52S (a). The number 
of viable cells was also measured with the MTT colorimetric assay and gave 
similar curves. Proliferation experiments using the human UT-7/Epo cell line 
(Komatsu. N.. et al, Cancer Res. 57:341-348 (1991)) and the Krystal assay 
(Krystal, G., Exp. Hmatol. 11:6^9-60 (1983)) produced identical results. 
Figures 12-12A. Interconnecting Loop CD. 
1 Figure 12. Schematic representation of the loop CD showing the 

location of three deletion mutems: A105-109, All 1-119, A122-126, and the 
insertion of seven residues after Lysll6 (myc epitope). The 0-glycosylation 
site is indicated by the dashed oval. 

Figure 12A. Secretion and biological activities of the muteins 
0 located in loop CD. The two bar graphs were created as described in 



wo 94/2416(1 



PCTA3S94/04361 



- 12 - 

Figure 1 1 A. The two muunts Al 11-119 and 1 16/myc were normally secreted 
and had full biological activities. 

Figures 13-13A. In Vitro Translation of the Epo WUd Type. 

Figure 13. Analysis of the '•'S-labeled translation products by 
5 SDS-PAGE. One-step transcription/translation reactions were performed in 
the SP6-TnT rabbit reticulocyte lysate system. 1/30 of each reaction was 
resolved on a 15% polyacrylamide gel. Lane Mow Mr standard from 
Amersham; lane 2-in vitro reaction without added plasmid; lanes 3 and 4 are 
translation products obtained after incubation of 1 of circular p64T-Epo, 
10 in the presence or absence of canine pancreatic microsomal membranes, 
respectively. 

Figure 13A. Binding of the in vitro translated Epo wild type onto 
Epo Receptor-GTS-agarose beads. 6xl(P cpm (counts per minute) of 
purified ^'S-labeled erythropoietin products, processed with microsomes (+) 
15 or not (-) were incubated in the presence of the extra-cytoplasmic domain of 
the Epo receptor (ERE.), following the protocol described by Harris, K.W., 
etal., J. Biol. Chem. 267:15205-15209 (1992). Identical binding 
demonstrated that the conservation of the propeptide did not impair the 
hormone/receptor interaction. 
20 Figures 14-14A. COOH-end of Epo. 

Figure 14. Sdiematic representation of the analyzed muteins, 
corresponding to the deletion of the four last amino acids A163-166 and the 
replacements of the residues 162 to 166 by a KDEL or poly (His) sequences. 
Figure 14 A. Relative secretion of these muteins. The bioactivities 
25 in the supematants (Krystal, G., Exp. Hematol. ll:m-60 (1983)) and the cell 
extracts (Komatsu, N., et al. Cancer Res. 57:341-348 (1991)) of transformed 
Cos? cells were measured by in vitro proliferation assay using HCD57. More 
KDEL mutant remained in the cytosol of the Cos7, when compared with the 
wild type Epo and A163-166 or poly (His) muteins. However, all the 
30 analyzed muteins had the same specific activity as that of the wild type. 



wo 94/24160 



PCTajS94/04361 



- 13 - 

Figure 15. Bacterial Expression of Wild Type Epo. 

Figure 15 Panel A. Diagram of the fusion protein. An NH- 

terminal 22 amino acid long peptide, containing a 10 histidine stretch, was 
fused to tlie mature erythropoietin sequence. Factor Xa cleavage allowed the 
5 recovery of the mature Epo with only two extra residues at its amino terminus. 

Figure 15 Panel B. IPTG inducUcn of the fusion protein. 
Transformed E. coli BL21 (DE3) cultures (OD«o=0.6) were grown in 
presence of 1 mM IPTG. Aliquots were collected at 0 (lane 2), 1 (lane 3), 2 
(lane 4) and 3 hours (lane 5) and analyzed on a 15% SDS-polyacrylamide gel, 
10 stained with Coomassie Brilliant Blue. High level production of the fusion 
protein was rapidly obtained. Lane 6 corresponds to an aliquot from 
transformed bacteria grown for 3 hours in a medium without IPTG. Lane 1 
is a low molecular weight standard. 

Figure 15 Panel C. PuriTication of the fusion protein. After 3 
15 hours of IPTG induction, the produced (His),o-Epo was solubilized in 6 M 
guanidine-HCl and purified on a nickel affinity resin by increasing imidazole 
concentrations following the pET-His system protocol (Novagen). Samples of 
the column eluants were analyzed by SDS-PAGE and stained with Coomassie 
Brilliant Blue. Lane 1-elution by 20 mM imidazole; lane 2-elution by 100 
20 mM imidazole, releasing the fusion protein; lane 3-chelation of the nickel by 
a 100 mM EDTA wash; lane 4-molecular weight standard. 

Figure 15 Panel D. Detection of the E. coli recombinant Epo on a 
Western blot. Solubilized proteins were separated on a 15% SDS- 
polyacrylamide gel, transferred to a nitrocellulose membrane and probed with 
25 a 1/2000 dilution of our native wild type polyclonal antibody, as described 
under Materials and Methods. Lane 1 -analysis after oxidative reduction; 
lane 2-after dialysis against the factor Xa buffer; and lane 3-after factor Xa 
cleavage. 

Figure 16. Relationship Between Production of Muteins and 
30 Proposed Secondary Structure. This bar graph shows the amount of 
secreted proteins in the supematants of transiently expressed Epo mutants, as 
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detected by radioimmunoassay. The muteins were aligned over a schematic 
representation of the native Epo molecule. Each deletion is shown as a 
stippled bar, the width of which is proponional to the number of residues 
deleted. The four a helices are represented by the black rectangles. The two 
5 disulfide bridges are indicated. These mutagenesis results are in good 
agreement with our proposed four a-helical model of Epo. 

Figure 17. Biological Activity of Helbc A Muteins. The top panel 
shows a helical wheel projection of Epo helix A with arrows indicating the 
positions where modifications were made to create muteins. The lower panel 
10 is a dose response curve representing the biological activity of wild type Epo 
(□) and helix A muteins E13A (♦), R14A (D). E18A (0), K20A and 
E21 A (□) over a range of dosages (X-axis; mU =mil!i-units of Epo, a relative 
mass measurement) as determined by the relative incorporation of pH]- 
thymidine measured in counts per minute (cpm) (Y-axis) into Epo dependent 
15 HCD57 cells. 

Figure 18. Biological Activity of Helix D Muteins. The top panel 
shows a helical wheel projection of Epo helix D with arrows indicating the 
positions where modifications were made to create muteins. The lower panel 
is a dose response curve representing the biological activity over a range of 
20 dosages (X-axis) of wild type Epo (id) and helix D muteins D136A (♦), 
R139A (D). K140A (O) and 143A (x) as determined by the relative 
incorporation of pH]-thymidine (Y-axis) into Epo dependent HCD57 cells. 

Figure 19. Biological Activity of Muteins Adjacent to Helix D. 
The top panel shows the region adjacent to the C-terminal boundary of Epo 
25 helix D with arrows indicating the positions where modifications were made 
to create muteins. The lower panel is a dose response curve representing the 
biological activity over a range of dosages (X-axis) of wild type Epo (b) and 
muteins adjacem to helix D K152A (♦). K154A (U). and Y156A (O) as 
determined by the relative incorporation of ['H]-tiiymidine (Y-axis) into Epo 
30 dependent HCD57 cells. 
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Figure 20. Comparison of Mutcins With Increased Biological 
Activity. A dose response curve representing the biological activity of wild 
type Epo (-*-), and mutcins R143A (□) and K154A (♦) with increased 
biological activity relative to wild type Epo is shown over a range of dosages 
5 (X-axis) as determined by the relative incorporation of pH]-thymidine (Y-axis) 
into Epo dependent HCD57 cells, n. 

Figure 21. Comparison of Muteins Whh Increased Biological 
Activity in Non-malignant Cells. A dose re^nse curve representing the 
biological activity of wild type Epo (-)e), and muteins R143A (□) and K154A 

10 ( ♦ ) with increased biological activity relative to wild type Epo, is shown over 
a range of dosages (X-axis) as determined by the relative incorporation of 
[^H]-thymidine (Y-axis) into murine spleen cells. 

Figure 22. Comparison of Muteins With Increased Biological 
Activity in a Human Cell Line. A dose re^onse curve representing the 

15 biological activity of wild type Epo (x), and muteins R143A (□), and K154A 
( ♦ ) with increased biological activity (relative to wild type Epo) is shown 
over a range of dosages (X-axis) as determined by the relative incorporation 
of ['H)-thymidine (Y-axis) into human Epo-dependent UT-7/Epo cell line 
(Komatsu, N. et at., Cancer Res. 51: 341-348 (1991). 

20 Figure 23. Comparison of Muteins with Increased Biological 

Activity in a Human Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein N147A ( ♦ ) with increased 
biological activity (relative to wild-type Epo) is shown over a range of dosages 
(mU) (X-axis) as determined by the relative incorporation of pH]-thymidine 

25 (Y-axis) into human Epo-dependent UT-7/Epo cell line. 

Figure 24. Comparison of Muteins with Increased Biological 
Activity in a Human Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein NI47A ( ♦ ) with increased 
biological activity (relative to wild-type Epo) is shown over a range of dosages 
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(mU) (X-axis) as determined by the relative incorporation of ['H]-thymidine 
(Y-axis) into human Epo-dependent UT-7/Epo cell line. 

Figure 25. Comparison of Muteins with Increased Biological 
Activity in a Human CeU Line. A dose response curve representing the 
5 biological activity of wild-type Epo (a) and mutein N147A (♦.«!) with 
increased biological activity (relative to wild-type Epo) is shown over a range 
of dosages (mU) (X-axis) as determined by the relative incorporation of ['Hj- 
thymidinc (Y-axis) into human Epo-dependent UT-7/Epo cell line. 
Biological Activities of Helix A Muteins. 
10 Figure 26. Comparison of Muteins with Increased Biological 

Activity in a Human Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein K20A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
15 (Y-axis) into UT-7 Epo cells. 

Figure 27. Comparison of Muteins with Increased Biological 
Activity in a Human Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (n) and mutein Y49S with increased 
biological activity relative to the wild type Epo, is shown over a range of 
20 dosages (X-axis) as determined by the relative incorporation of ['H]-thymidine 
(Y-axis) into UT-7 Epo cells. 
Biological Activities of Helix B Muteins. 

Figure 28. Comparison of Muteins with Increased Biological 
Activity in a Human CeU Line. A dose response curve representing the 
25 biological activity of wild-type Epo (13) and mutein A73G with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of [^H]-thymidine 
(Y-axis) into UT-7 Epo cells. 
Biological Activities of Helix D Muteins. 
30 Figure 29. Comparison of Muteins with Increased Biological 

Activity in a Human Cell Line. A dose response curve representing the 
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biological activity of wild-type Epo (Q) and mutein K140A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of ['H]-thymidine 
(Y-axis) into UT-7 Epo cells. 
5 Figure 30a. Comparison of Muteins with Increased Biological 

Activity in a Human Cell Line. A dose re^nse curve representing the 
biological activity of wild-type Epo (b) and mutein R143A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
10 (Y-axis) into UT-7 Epo cells. 

Figure 30b. Comparison of Muteins with Increased Biological 
Activity in a Mouse Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (o) and mutein R143A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
15 dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into HCD 57 cells. 

Figure 31. Comparison of Muteins with Increased Biological 
Activity in a Human Cell line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein S146A with increased 
20 biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into UT-7 Epo cells. 

Figure 32. Comparison of Muteins with Increased Biological 
Activity in a Human Cell Line. A dose response curve representing the 
25 biological activity of wild-type Epo (0) and mutein N147A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into UT-7 Epo cells. 

Figure 33a. Comparison of Muteins with Increased Biological 
30 Activity in a Human Cell line. A dose response curve representing the 
biological activity of wild-type Epo (Q) and mutein K154A with increased 
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biological activity relative to the wild type Epo. is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of ['H]-thymidine 
(Y-axis) into UT-7 Epo cells. 

Figure 33b. Comparison of Muteins with Increased Biological 
Activity in a Mouse Cell Line, A dose response curve representing the 
biological activity of wild-type Epo (b) and mutcin K154A with increased 
biological activity relative to the wild type Epo. is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of ['H]-thymidine 
(Y-axis) into HCD 57 cells. 

Figure 33c. Comparison of Muteins with Increased Biological 
Activity in Non-Malignant Mouse Cells. A dose response curve representing 
the biological acdvity of wild-type Epo (E) and mutein K154A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by die relative incorporation of pH]-thymidine 
(Y-axis) into primary murine spleen cells. 

Figure 34. Bioactivity of Epo Replacement Muteins. The letters in 
die column on the left designate predicted helices (A, B. C, D) or interhelical 
loops (A-B. C-D). The 3'D designation (3' of D) in die table represents 
amino acids 153-166 which are at the carboxy-end of die D-hclix. The amino 
acid of wild type human Epo, its position from die N-tcrminus, and its 
replacement according to die single amino acid letter code, are listed under die 
mutein column, e.g. S9A. represents a mutation from die wild-type human 
Epo at die amino acid serine of position number 9 to alanine. The diree 
bioassay columns (human UT7 cell line, murine HCD57 cell line, and primary 
mouse spleen erydiroid cells) show specific bioactivity of each mutein, 
expressed as a percentage of wild type human Epo bioactivity, widi die 
background COS supernatant alone subtracted from die value. The mean and 
standard deviation of die bioassay values are listed for nS:3 determinations. 
The numbers widiin die brackets indicate die number of separate COS cell 
) tiansfections over (/) die number of separate bioassays diat were performed. 
NS: not secreted from COS 7 cells. 
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DefinUions 

In order to provide a clearer and more consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
following definitions are provided. 

Administraticn. The term "administrauon' is meant to include 
introducuon of the Epo muteins of the invention into an animal or human by 
any appropriate means knov^n to the medical or veterinary an, including, but 
not limited to, injection, oral, enteral, and parenteral (e.g.. intravenous) 
administration. 

Amino Acid Codes. The most common amino acids and their codes are 
described in the following table: 
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Table 1 




Amino acid names and codes 


Amino acid 


Single letter 
code 


Three letter 
code 


Alanine 


A 


Ala 


Argininc 


R 


Arg 


Aspartic acid 


D 


Asp 


Asparagine 


N 


Asn 


Cysteine 


C 


Cys 


Glutamic acid 


E 


Glu 


Glutamine 


Q 


Gin 


Glycine 


G 


Gly 


Histidine 


H 


His 


Isolcucine 


1 


He 


Leucine 


L 


Uu 


Lysine 


K 


Lys 


Methionine 


M 


Met 


Phenylalanine 


F 


Phe 


Proline 


P 


Pro 


Serine 


S 


Ser 


Threonine 


T 


Thr 


Tryptophane 


W 


Trp 


Tyrosine 


Y 


Tyr 


Valine 


1 V 


Val 



Animal. The term "animal" is meant to include all animals whose 
25 production of red blood cells (erythrocytes) is dependent upon, or stimulated 
by, erythropoietin (Epo). Foremost among such animals are humans; 
however, the invention is not intended to be so limiting, it being within the 
contemplation of the present invention to treat any and all animals which may 
experience the beneficial effect of the Epo muteins of the invention. 
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Efficacious Amount. An "efficacious amount" of an Epo mutein of the 
invention is an amount of such Epo mutein that is sufficient to bring about a 
desired result, particularly the stimulation of red blood cell production, 
especially upon administration to an animal or human. 
5 Erythrocyte. The term "erythrocyte" is intended to refer to a red blood 

cell. 

Erythroid Precunor Cells. By "erythroid precursor cells" is intended 
ceils with a capability to proliferate and differentiate into erythrocytes that is 
dependent upon exposure or contact with Epo. 

10 Erythrvpoietin (Epo). The term "erythropoietin", or "Epo" is 

primarily intended to refer to the mature form of this hormone. The terms 
"wild type Epo" and "native Epo" are used interchangeably to refer to Epo in 
its naturally occurring, unmodified form. Unless otherwise indicated, amino 
acid positions on the Epo protein referred to herein are made with reference 

15 to the mature, 166 amino acid human Epo sequence shown in Figure 6 and 
corresponding sequences from other species. 

Host Cell. By "host" or "host cell" is intended the cell in which a gene 
encoding an Epo mutein of the invention is incorporated and expressed. An 
Epo mutein gene of the invention may be introduced into a host cell as part 

20 of a vector by transformation. 

Mutein. By "mutein" is meant a mutant or modified protein with one 
or more modifications to its native amino acid sequence in the form of amino 
acid additions, deletions, or substitutions. An erythropoietin mutein refers to 
a modified erythropoietin protein. 

25 Pharmaceutically Acceptable Vehicle. The term "pharmaceutically 

acceptable vehicle" is intended to include solvents, carriers, diluents, and the 
like, which are utilized as additives to preparations of the Epo muteins of the 
invention so as to provide a carrier or adjuvant for the administration of such 
Epo muteins. 

30 Tmnsfomation. By "transformation" is intended the act of causing a 

host cell to contain a desired nucleic acid molecule, including either a native 
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Epo gene or a gene encoding an Epo mutein of the invention, not originally 
part of that cell using methods Icnown in the an. 

Transfection. By "transfection" is intended the introduction of a DNA 
or RNA vector carrying a desired nucleic acid molecule, including either a 
5 native Epo gene or a gene encoding an Epo mutein of the invention, into a 
host cell. 

Vector. By "vector" is intended a DNA element used as a vehicle for 
cloning or expressing a desired gene, such as an Epo mutein gene of the 
invention, in a host. 

1 0 Detailed Description of the Preferred Embodiments 

This invention is drawn to modifications of Epo which enhance its 
biological activity. Due to their enhanced activity, the Epo muteins of the 
invention provide a preferable alternative to native Epo where it is used to 
stimulate red blood cell production (e.g. treatment of blood disorders, cell 
15 culture, etc.). 

The present invention teaches a structural model of the Epo protein 
useful for identifying functional regions which may be modified to enhance 
biological activity. According to the invention, Epo is predicted to have a 
four anu-parallel amphipathic a-helical bundle structure. Based upon this 
20 predicted structure, functionally important regions predicted to reside on the 
external surfaces of the helices (designated A-D, see Figures 9A-9B) are 
identified. In particular, the external surfaces of helices A and D, predicted 
to be involved in Epo receptor binding, are identified. According to the 
invention, these functionally important regions serve as targets for 
25 modifications which may enhance biological activity. Increased activity 
relative to the wild-type Epo can be found with amino acid substitutions in the 
A,B,C or D helix. 

According to one aspect of the invention, Epo muteins are provided in 
which one or more of the amino acids within the predicted external surfaces 
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of helices A, B, C. D and 3' of D, or within the region immediately adjacent 
to the C-teminal end of helix D, have been replaced. In particular, amino 
acid substitutions at positions 20, 49, 73, 140, 143, 146, 147 and 154 of wild 
type Epo have been replaced. The inventors have discovered that substitution 

5 of the amino acids normally occurring at these positions results in Epo muteins 
with significantly higher biological activity than wild type Epo proteins. 

According to the invention, the amino acids at position 20 (typically 
alanine), 49 (typically serine). 73 (typically glycine), 140 (typically lysine), 
position 143 (typically arginine), position 146 (typically serine), position 147 

10 (typically asparagine), and 154 (typically either lysine or threonine) of wild 
type Epo are preferably replaced with an alanine residue. In addition to 
alanine other amino acids with a chemical structure and properties similar to 
alanine, such as serine and threonine, are contemplated by the invention as 
suitable substitutes for achieving Epo muteins of the invention with enhanced 

15 biological activity. 

In another aspect of the invention, a jxjrtion of the predicted interloop 
region between helices A and B extending from amino acid 48-52 is identified 
as important to Epo function. According to the invention, this functionally 
important region serves as another target for modifications which may enhance 

20 biological activity. The inventors have discovered that substitution of the 
amino acid at position 49 results in Epo muteins with significantly higher 
biological activity than wild type Epo proteins, 

According to the invention, the amino acid at position 49 (typically 
tyrosine) of wild type Epo is preferably replaced with a serine residue. Other 

25 amino acids with a chemical structure and properties similar to serine, such 
as alanine and threonine, are contemplated as suitable substitutes for achieving 
Epo muteins of the invention with enhanced biological activity. 

The modifications taught by the present invention occur in regions thai 
are highly conserved among Epo proteins from evolutionarily divergent species 

30 (see Example I and Figure 6 in particular). These modifications are thus 
expected to be broadly applicable to Epo proteins which are substantially 
similar to the human Epo protein in the regions where the modifications occur. 
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including, but not limited to, Epo proteins from monkey, mouse, rat, sheep, 
pig, cat, and dog. 

Biological activity of Epo muteins can be determined by assaying the 
ability of these proteins to stimulate the proliferation of Epo responsive cells 

5 such as murine spleen cells (Krystal, G., Exp. Hemaiol. ii:649-60 (1983); 
Goldberg. M.A., et al., Proc. Natl. Acad. Sci. USA 5^:7972-7976 (1987)), 
murine Epo responsive murine erythroleukemia cells (Hankins, W.D., et al., 
Blood 70:173a (1987)), and human Epo-dependent UT-7/Epo cells (Komatsu, 
N., etal.. Cancer Res. 57:341-348 (1991)). 

10 When assaying for biological activity, it is important to recognize that, 

as taught by the present invention, high levels of biologically active Epo can 
cause terminal maturation and cessation of division of Epo responsive cells. 
Because of this possibly confounding phenomenon, it is important to determine 
biological activity based on a full dose response curve over a broad range of 

15 dosages which includes non-saturating levels of the particular Epo mutein 
being assayed. 

Although the mechanism by which the modifications taught by the 
invention increase biological activity is not completely understood, one 
possibility is that these modifications enhance receptor binding affinity. This 
20 possibility is thought to apply particularly to the modifications at positions 20, 
49. 73, 140. 143, 146, 147, and 154, which lie within a region predicted to 
be involved in Epo receptor binding according to the teachings of the present 
invention. 

Genes Encoding Epo muteins (Epo mutein genes) 

25 In addition to the Epo muteins themselves, genes encoding these 

muteins are also contemplated by the present invenuon. (Jenes encoding the 
Epo muteins of the invention may be made by modification of the native Epo 
gene using standard methods well known to those of skill in the art. 

The native Epo gene may be obtained using a variety of standard 

30 techniques. For example, a DNA molecule corresponding to the known 
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sequence of the Epo gene from a number of species may be artificially 
synthesized (See Example 1 and Figure 4). The Epo gene may also be isolated 
from a cDNA or genomic library using nucleic acid probes based on available 
Epo DNA sequence information and standard hybridization techniques as 
5 taught in U.S. Pat. No. 4,703.008, issued Oct. 27, 1987. which is herein 
incorporated by reference (See also Jacobs et al.. Nature 313: 806-810 
(1985)). Alternatively, amplification of Epo gene sequences by polymerase 
chain reaction (PCR) may be accomplished using primers based on available 
sequence data as taught in Example I. 

10 Once obtained, the wild type Epo coding sequence can be altered to 

encode an Epo mutein of the invention using standard oligonucleotide-directed 
in vitro mutagenesis techniques (See Zoller. M. and Smith. M., Methods in 
Enzymology 100: 468-500 (1983); Dente, L. etal.. Nucl. Adds Res. 11: 1645 
(1983); Kunkel, T.A., et al. Meth. Enzymol. 7J¥:367-382 (1987); Vandeyar, 

15 M. et al., Gene 65: 129 (1988)). These techniques generally involve 
hybridization of an oligonucleotide carrying the desired alteration to a single 
stranded target DNA. This is followed by complementary strand synthesis of 
the target DNA which incorporates the oligonucleotide carrying the desired 
alteration. The complementary strand containing the desired alteration is then 

20 propagated in a bacterial host cell. Specific application of this type of 
technique to modify the Epo gene is described in Examples II and III and in 
Australian Patent Publication No. AU-A-59145/90 by Fibi et al. and in U.S. 
Pat. No. 4,835,260 by Shoemaker, both of which are herein incorporated by 
reference. 

25 Expression of Epo mutein Genes 

Genes encoding Epo mutcins of the invention can be introduced and 
expressed in a selected host cell system using conventional materials and 
techniques. DNA elements such as promoters, enhancers, polyadenylation 
sites, transcription termination signals, and the like should be associated with 



wo 94/24 160 



PCT/US94/04361 



the Epo mutein coding sequence so as to promote and control expression of 
the Epo mutein. The specific regulatory elements used will depend upon the 
host cell system selected for expression, whether secretion of the protein is 
desired, and other considerations readily apparent to one of skill in the an. 

5 Various vectors may be employed as vehicles for the introduction and 

expression of Epo mutein genes in a host cell. Such vectors useful in the 
different host cell types are well known and include, for example, the 
mammalian expression vectors pSG5 (Stratagene), p-RKl (Genetics Institute), 
P-SVK3 (Pharmacia), p-EUK-Cl (Clontech), pCDM (Invitrogen), pc DNAl 

10 (Invitrogen), and the bacterial expression vectors pFLAG-1 (IBI), all pET 
system plasmids (Novagen), pTrcHis (Invitrogen), the pGEX scries 
(Pharmacia), and pKK 233-2 (Clontech). These vectors may be maintained 
as episomes in the host cell or they may facilitate integration of the Epo 
mutein gene into the host cell genome, or both. Vectors may also include 

15 other useful features, such as genes which allow for the selection or detection 
of cells in which they have been successfully introduced. 

Host cells suitable for expression of the Epo muteins of the invention 
include, but are not necessarily limited to. E. coli, yeast, insect cells, plant 
cells, and a variety of mammalian cell types, including in particular Chinese 

20 Hamster Ovary (CHO) cells, Cos7 cells, Cosl cells, baby hamster kidney 
cells, and CVl cells. Epo mutein genes can be introduced into a host cell 
using standard transformation or transfection techniques. 

Host cells which provide for glycosylation are preferred for production 
of Epo muteins to be used in vivo since the carbohydrate structure of Epo is 

25 important for optimal biological activity in vivo {See Dordal, M.S. et al., 
Endocrinology 116(6): 2293-2299 (1985)). 

The Epo muteins of the invention may be recovered and purified from 
hn» in which fhsy are p.itnrpw/1 iicing known iripthods SUCh as 

immunoaffmity chromatography with antibodies to human Epo. 
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Therapeutic Use of Epo muteins 
of the Invention 

The Epo muteins of the invention are suitable for therapeutic use in 
animals, particularly humans, and may be used in the same manner as wild 
5 type Epo, except that lower dosages will be required to achieve the same level 
of biological activity. 

Administration of the Epo muteins of the invention may be 
accomplished by any of the methods known to the skilled artisan, provided 
that the method effectively places the Epo mutein in the appropriate 

10 environment to exhibit its activity (i.e. in contact with erythroid precursor 
cells). A preferred method is by parenteral routes, including intravenous and 
subcutaneous administration. 

Administration will ordinarily include an efficacious amount of the Epo 
mutein supplied in a pharmaceutically acceptable vehicle. The amount of Epo 

15 mutein which is efficacious will be determined by a trained professional 
depending upon a number of considerations, including the condition being 
treated and its severity, the sex and body weight of the subject being treated, 
the method of administration, and the presence of compositions in the 
pharmaceutically acceptable vehicle which affea Epo activity. 

20 In any event, the efficacious amount of a Epo mutein as taught by the 

invention will be significantly less than the corre^nding amount of 
unmodified Epo needed to thieve the same degree of efficacy. The 
difference between the efficacious amount of the Epo mutein and the 
analogous amount of unmodified Epo will correspond with the increased 

25 biological activity exhibited by the Epo mutein. It is contemplated that an 
efficacious amount of an Epo mutein of the invention will be as low as 10-fold 
lesE than nativs Epo, or as low as about 5-lQ units'kg body v.'eight for treating 
chronic renal failure patients or about 10-30 units/kg body weight for treating 
HIV-infected patients with serum Epo levels less than or equal to 500 mU/ml 
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(milliunits/milliliter) who are receiving a dose of AZT equal to, or less than, 
4200 mg/week. 

Since the Epo muteins of the invention can be used effectively at lower 
dosages relative to wild type Epo, their use may reduce potential adverse 

5 effects associated with administration of Epo such as exacerbation of 
hypenension, seizures, headaches, tachycardia, nausea, clotted vascular 
access, shortness of breath, hyperkalemia, and diarrhea (see PDR, 1993 
edition, at pp. 603-604). 

Further description of methods and materials useful in the recombinant 

10 production of the Epo muteins of the invention and their use is provided in the 
teachings of U.S. Patent No. 4,835,260 by Shoemaker and U.S. Patent 
No. 4,703,008 by Lin, both of which are herein incorporated by reference in 
their entirety. 

The invention may be better understood and appreciated by reference 
15 to the following examples. These examples are provided merely for 
illustrative purposes and should not be considered in any way as limiting the 
scope of the claimed invention. 



EXAMPLE I 



Erythropoietin Structure-Function Relationships: 
20 High Degree of Sequence Homology Among Mammals 



Summary 

In order to investigate structure-function relationships of erythropoietin 
(Epo), we have obtained cDNA sequences that encode the mature Epo protein 
of a variety of mammals. A first set of primers, corresponding to conserved 
25 nucleotide sequences between mouse and human DNAs, allowed us to amplify 
by PGR intron 1-exon 2 fragments from genomic DNA of hamster, cat, lion, 
dog, horse, sheep, dolphin and pig. Sequencing of these fragments permitted 
the design of a second generation of species-specific primers. RNA was 
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prepared from anemic kidneys and reverse-transcribed. Using our battery of 
species-specific 5' primers, we were able to successfully PCR amplify Epo 
cDNA from Rhesus monkey, rat. sheep, cat and pig. Deduced amino acid 
sequences of mature Epo proteins from these animals, in combination with 

5 known sequences for human. Cynomolgus monkey and mouse, showed a high 
degree of homology, which explains the biological and immunological cross 
reactivity that has been observed in a number- of species. Human Epo is 91 % 
identical to monkey Epo. 85% to cat Epo and 80-82% to pig, sheep, mouse 
and rat Epos. There was full conservation of a) the disulfide bridge linking 

10 the NHj and COOH termini; b) N-glycosylation sites; and c) predicted 
amphipathic a-hcliccs. In conu^t, the short disulfide bridge (C29/C33 in 
human) is not invariant. Cys33 was replaced by a Pro in rodents. Most of 
the amino acid replacements were conservative. The C-terminal part of the 
loop between the C and D helices showed the most variation, with several 

15 amino acid substitutions, deletions and/or insertions. Calculations of 
maximum parsimony for intron 1/exon 2 sequences as well as coding 
sequences enabled the construction of cladograms that are in good agreement 
with known phylogenctic relationships. 

INTRODUCTION 

\ 

A large number of eariy physiologic studies have established extensive, 
though not complete, biological cross-reactivity between Epos of man and a 
number of other mammals including mouse, rat, sheep and rabbit (Jelkmann, 
W., Pfvfsiol. Rev. 72:449 (1992); Garcia, J.F. et al., Proc. Soc. Exp. Biol. 
Med. 772:712-714 (1962); Zanjani, E.D. et ai, Proc. Soc. Exp. Biol. 
Med. 757:1095-1098 (1969); Biljanovic-Paunovic, L. eta!.. Period. Biol. 
90:421-421 (1988)). !r. contrast, ncn mammalian vertebrates (amphibians 
(Rosse, W.F., et al., Blood 22:66-72 (1%3); Pindcr. A. et al., J. Exp. Biol. 
705:205-213 (1983)), birds (Rosse, W. etal., Blood 27:654-661 (1966); 
Black, CP. etal., Respirat. Physiology 59:217-239 (1980)), fish (Zanjani, 
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E.D.. e( ai. Science 33 ■.5'13 (1969)) have erythropoietic hormones that fail to 
cross- react with mammalian erythroid cells, and vice-versa (Rosse, W.F., 
etaL. Blood 22:66-12 (1963); Pinder, A. et ai, J. Exp. Biol. 705:205-213 
(1983); Rosse, W. el al., BW 27:654-661 (1966)). 
5 Thus far, the Epo genes of man (Jacobs, K. et al. , Nature 375:806-10 

(1985) ; Lin. F.-K. et al., Proc. Natl. Acad. Sci. USA 82:7580-7584 (1985)), 
a monkey {Macaco fascicularis) (Lin, F.-K. et al.. Gene 44:2Q\-209 (1986)) 
and a rodent, the mouse (McDonald, J.D. et al., Mol. & Cell Biol. 6:842-848 

(1986) ; Shoemaker, C.B. etai, Mol. & Cell Biol. 6:849-858 (1986)) have 
10 been cloned, sequenced, and expressed. In view of the marked cross- 
reactivity between mammalian Epos, it is not surprising that there is a high 
degree of sequence homology in the coding region of the mature secreted 
proteins. In keeping with their close phylogenetic relationship, human and 
monkey Epos are 94% and 91% identical in nucleotide and amino acid 

15 sequence respectively. In contrast, human and mouse Epos are 76% identical 
in nucleotide sequence and 80% identical in amino acid sequence. 

Direct information on the three-dimensional structure of Epo is not yet 
available. Insights into structure-function relationships of Epo can be gained 
from the analyses of a more complete set of animal sequences. Such 

20 information could be useful for sequence-based computer modeling of three- 
dimensional structure. Moreover, a larger data base would permit the 
identification of highly conserved domains that are likely to be crucial to 
folding and/or biological function. Finally, comparative amino acid and 
nucleotide sequence information on Epo provides additional data for 

25 investigating phylogenetic relationships. 

In this Example, the use of PCR based techniques for obtaining the full 
coding sequences of Epos of Rhesus monkey, rat, sheep, pig and cat, as well 
as "artia! s?"u?nc?s frotri four othrr species is demonstra'r'i 
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Materials and Methods 

Animal Samples. Human hepatoma Hep3B cell line and multiple 
kidney-derived cell lines from hamster (BHK), sheep (MDOK), pig (LLC- 
PKl), dog (MDCK), cat (CRF-K). lion (PALl-K) and spotted dolphin (SPl-K) 
5 were obtained through the American Type Culture Collection (American Type 
Culture Collection, 12301 Parklawn Drive. Rockville, MD 20852). 
Monolayer cells were grown in 100 x 20 mm tissue culture dishes using 
recommended media and maintained in a humidified 5% COJ95% air 
incubator at 37°C. In some experiments, cells were made hypoxic by 

10 overnight incubation in 1 % 0;, 5% COj, 94% N, at 3T'C. 

Poly A + RNA prepared from Rhesus monkey kidney was purchased 
from Clontech (Palo Alto, CA). Kidneys from anemic cat and horse were 
obtained from veterinarians at Tufts School of Veterinary Medicine and the 
University of Nebraska, respectively. Anemia was induced by three 

15 consecutive daily intraperitoneal injections of phenylhydrazine (60 mg/kg body 
weight) into male Sprague-Dawley rats (SD-strain) and by repeated bleeding 
of the sheep and the pig. Kidneys were ascptically removed after induction 
and stored immediately in liquid nitrogen. 

A 4 kb genomic human Epo clone (g Epo4) was provided by Genetics 

20 Institute (Boston, MA). 

DNA and RNA Preparations. DNAs from cell lines or homogenized 
kidneys were extracted using pancreatic RNAasc/SDS/proteinase K following 
a procedure modified from Blin and Stafford (Blin, N. et al., Nucleic Acids 
Res. i:2303-2308 (1976)). 

25 Kidneys were homogenized in 4 M guanidine isothiocyanate (10 ml/g 

of frozen organ), containing 0.1 M beta-mercaptoethanol and 10% N-lauryl- 
wrposine. Total R.NAe were isolated by cer.trifugaticr. over 5.7 N! CsCl 
(Chirgwin. J.M. ei ai, Biochemistry 75:5294 (1979)). After ethanol 
precipitation, the samples were resuspended in diethyl pyrocarbonatc-trcated 

30 water and stored at -70°C. Confluent monolayer cells were washed twice with 



wo 94/24160 



PCT/US94/04361 



Sterile phosphate buffered saline and directly iysed in the 4 M guanidine 
isothiocyanate solution. Total RNAs were isolated as described above for the 
kidneys. 

RNA samples were first converted into single strand cDNA. 2 to 4 /xg 

5 of total RNA from kidney or 500 ng to 1 fig of total RNA from cultured cells 
were denatured at 68°C in the presence of 2 fig of oligo dT<,j,. Reverse 
transcription was carried out in a 20 ^1 final volume, containing 50 mM Tris- 
HCl pH 8.3. 6 mM MgCU, 40 mM KCl. 10 mM DTT. 500 ;xM dNTPs. 20 
u RNAsin (Promega. Madison, Wl) and 20 u of avian myeloblastosis virus 

10 reverse transcriptase (AM V-SuperRT. Molecular Genetic Resources. Tampa. 
FL). The reaction was allowed to proceed at ST'C for 20 minutes, then 
at 42*'C for one hour. Inactivation of the enzyme was performed at lOO^C 
for 10 minutes. cDNA samples were stored at -lO'C. 

Polymerase Chain Reaction. 20 to 100 ng of genomic-DNA or 0.5 

15 to 1 fil of the RT-reaction were used as templates for amplification. 

Standard PGR reactions were performed in a 100 volume 
containing 50 mM Tris-HCl pH 8.3, 50 mM KCl. 1.5 mM MgClj. 0.01 % w/v 
gelatin, 200 ^M dNTPs, 30 pM of each sense (5') and antisense (3') primers 
and 0.5 units of Taq DNA polymerase (Perkin Elmer Cetus, Emeryville. CA). 

20 After an initial denaturation step for 5 minutes at 95 "C. routinely 35 cycles 
of PCR were performed on the DNA Thermal Cycler (Perkin Elmer Cetus). 
The denaturation step of each cycle was carried out at 94 °C for 1 min. For 
each sample and/or the primer pair used, annealing temperatures were 
optimized (from 45 "C to 60°C) for a 2-minute reaction. A 3-minute 

25 elongation step at 72°C ended each cycle. 

Amplification products were analyzed on 1 to 3% agarose gel-TAE. 
Nu Sieve or Seaplaque agaroses (FMC, Rockland, ME) were used for 
nr?"arative purposes. Specific PCR products were recovered from Lhe gel, by 
the use of the Gene Clean 11 kit (Bio 101, La JoUa. CA). by Spin X 

30 centrifugation (Costar, Cambridge, MA) or by standard phenol/chloroform 
extractions of melted gel slices. 
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Subcloning and Sequencing. Small (250-330 bp) amplified genomic 
fragments were directly subjected to automated sequencing, using the PCR 
primers as DNA sequencing primers. 

PCR-generated cDNA products were blunt-ended using Klenow 

5 fragment and 5' phosphorylated by T4 DNA kinase (Boehringer Mannheim. 
Indianapolis, IN). They were then cloned into the Smal site of the phagcmid 
pBluescript II SK (Stratagene, La Joila, CA). The double-stranded plasmid 
was sequenced using the SK and KS sequencing primers. 

In all cases. Sequencing reactions were carried out on an Applied 

10 Biosysiems 373 A Automated DNA Sequencer utilizing ABl's DyeDeoxy 
Terminator Sequencer kit (Foster City, CA) and thermal cycling with Taq 
DNA polymerase (Promcga, Madison, WI) as previously described (Tracy, 
T.E. eiai. Biotechniques ll:6i-7A (1991)). To avoid errors, for each 
species, samples derived from different amplificaiion/cloning experiments were 

15 prepared and both strands were sequenced. 

Mammalian Expression; Bioactivity. Vectors containing full-length 
mammalian Epo coding sequences were subcloned into pSG5 plasmid 
(Pharmacia, Piscataway, NJ) and were transiently expressed in Cos7 cells. 
72-hour supematants were tested for their ability to sustain cellular 

20 proliferation of the Epo-dcpcndeni HCD57 cell line (Sawyer, S.T. et al. 
Blobd 72 (suppl. 1): 132a (1988)). 

Results and Discussion 

Because of the known biological cross-reactivity between various 
mammalian Epos and the presumption of strong homology in the coding 
25 sequences, a logical cloning strategy would be to screen cDNA libraries with 
moderate lo iow stringency. However, Epo mRNA is expressed at bareiy 
detectable levels in all tissues except in hypoxic kidney and liver (Goldberg, 
M.A. <;/ ai, Proc. Natl. Acad. Sci. USA 84: 7972-7976 (1987); Fandrcy, J. 
etai. Blood 81: 617-623 (1993)). Therefore it would be necessary to 
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generate libraries for each of the species of interest. Our pnmary research 
interest is in structure-function relationships of Epo and therefore we require 
full coding sequences of the mature Epo protein, rather than full cDNA 
sequences. Accordingly, we elected to employ a PCR cloning strategy 

5 predicated on known strong homologies in Epo cDNA sequences that flank the 
region encoding the mature protein. There was sufficient conservation of 
sequence around the NCOl site in the 3' untranslated region (UTR) that a 
single primer could be used to amplify full length mature Epo coding sequence 
in rat. sheep, pig and cat as well as dog and rabbit (data not shown). 

10 However, we were unsuccessful in finding a universal 5' primer and therefore 
had to generate species specific sequences from amplifications of genomic 
DNA. This two stage PCR strategy proved to be satisfactory in generating 
highly accurate sequence information with a minimum expenditure of time and 
resources. In all instances, there was full agreement between sequences of 

15 complementary strands. 

Preparation of Mammalian Epo cDNAs 

Primer Design for PCR Amplification of Epo. The sequences of Epo 
genes from three species have been already published: two primates, human 
(Jacobs, K. et al., Nature 31 3:i06-m (1985); Lin, F.-K. et ai, Proc. Nail. 

20 Acad. Sci. USA S2:7580-7584 (1985)) and Cynomolgus monkey {Macaco 
fascicularis) (Lin, F.-K. et al.. Gene 44:201-209 (1986)) and a rodent, the 
mouse (Mm muscuius) (McDonald, J.D. et al.,Mol. & Cell Biol. 6:842-848 
(1986); Shoemaker, C.B. et al.,Mol. A Cell Biol. 6:849-858 (1986)). There 
is substantial homology at the nucleotide level not only in the coding 

25 sequences, but also in portions of introns and in 5' and 3' untranslated 
w'.niipfv*"; of exnns 1 and S rMnectively A number of primer";, 
corresponding to conserved nucleotide sequences between mouse and-human, 
were synthesized and tested for their ability to produce genomic-PCR 
fragments from a wide variety of mammalian cell lines. A 4 kb human 
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genomic clone and genomic DN A extracted from the human hepatoma cell line 
Hep3B were used as control templates for the PCR amplifications. 

In particular, two pairs of primers, IV1/EX2R and EX5/NC01 
(Figure 1), directed the amplifications of fragments of 330 and 250 bp 

5 (respectively) from genomic DNA of all human and mammalian cell lines that 
we investigated. IVl is a 25-mer oligonucleotide localized in intron 1, 
216/217 bases upstream from the stan of exon 2. Human and mouse are 
identical except that the human sequence contains an extra G nucleotide at 
position 996. IVl primer corresponding to the mouse sequence was 

10 synthesized. The 23-mer EX2R ends 28 nucleotides upstream from the 3' end 
of exon 2. EX5 is a 22-mer, beginning 34 bases downstream from the 5' end 
of exon 5. NCOl represents a 20 bp, 100% conserved DNA fragment, 
starting 112/113 bases downstream from the TGA stop codon and contains the 
unique NCOl site present in the human and mouse gene. 

15 An ATG primer was also synthesized, corresponding to a 20 

oligonucleotide stretch, extending both 5' and 3' from the initiator methionine 
codon. This sequence is also totally conserved between primates and mouse. 
While combinations of exonic primers (ATG1EX2R, EX5/NC01 and 
ATG/NCOl) were able to direct amplification from cDNA prepared from 

20 RNA of the human Epo-producing cell line Hcp3B, no amplification was 
obtained on cDNA prepared from uninduced or hypoxia-induced BHK, 
MDOK, LLC-PKl, MDCK, CRF-K, PALl-K and SPl-K cells (results not 
shown). Therefore these renal cell lines apparently do not express Epo 
mRNA, either constitutivcly or after hypoxic stress, at a level detectable even 

25 by RT-PCR. Furthermore, no Epo protein was detectable by 
radioimmunoassay in the supematants of cells maintained overnight in 1 % 0,, 
5% CO2. 94% N, at 37°C (Goldberg, M.A. etai, Proc. Natl. Acad. Sci. 
USA W:7972-7976 (1987)). 

The amplification of species specific IV1/EX2R and EX5/NC01 

30 genomic fragments allowed us to design a strategy shown in Figure 2 for 
obtaining cDNA clones containing the complete coding sequence of mature 
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Epo protein from various mammals. The IV1/EX2R primer pair resulted in 
a genomic fragment ( - 330 bp) corresponding to the 3' third of the first iniron 
and 80% of exon 2, encoding the COOH-end of the propeptide and the NH,- 
tenninal amino acids of the mature protein. EX5/NC01 primers amplified a 
5 segment of exon 5, containing the COOH-end of the Epo protein and a 3' 
untranslated sequence of about 140 bp downstream of the stop codon: DNA 
sequencing of these 5' and 3' fragments permitted, if necessary, the design of 
a second generation of species-specific primers (SP), localized outside the 
coding part of the mature protein (i.e., in the sequences encoding the 

10 propeptide and the 3' untranslated region). 

Amplification of Partial Intwn 1/ Exon 2 Genomic Clones. We first 
explored the efficacy of the IV 1 and EX2R primers for amplification of 
genomic fragments from different purified DNA templates. In all the 
reactions, fragments of predicted size were the main (and, when using the 

15 most stringent annealing temperature, often the only) products detected by 
agarose gel electrophoresis. The (direct) sequences of these PCR-generated 
fragments are presented in Figure 3. The nucleotide alignment includes 6 
different orders of mammals: one primate, human (Homo sapiens); two 
aniodactyls, sheep {Ovis aries) and pig {Sins scrofa); one perissodactyl, horse 

20 {Equus caballus); one cetacean, spotted dolphin {Stenella plagiodon); three 
carnivores, dog (Canis familiaris), cat {Felis catus) and lion (Panihera lea); 
two rodents, mouse {Mus musculus) and hamster {Cricetus ciicetus). Cat and 
lion exhibited an almost complete sequence identity, with only one T to G 
nucleotide substitution near the 5' end of the amplified portion of the first 

25 intron. 

The ability of the intronic IV 1 to anneal to genomic DNA from various 
species and, in combination with EX2R, to amplify PCR fragments of identical 
size, demonstrated a high degree of conservation of sequence and nnsitinn 
between mammals. This suggests that IV 1 sequence may be involved to some 
30 extent in the regulation of the Epo gene expression. The comparison of 
IV1/EX2R sequences also revealed two remarkably conserved intronic 
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sequences: AT{T/A)GAATGAA(G/C)GC [SEQ. ID NO; 1] (nucleotides 1046 
to 1058 in the human gene) and A(A/T)GGTN(G/C)GGG [SEQ. ID NO: 2] 
(nucleotides 1085 to 1094). 

Amplification of Partial Exon 5 Fragments from Genomic DNAs. 

5 PCR reactions were also performed ^plying the EX5/NC01 primer pair to 
various purified genomic DNAs. As previously observed for the amplification 
of IV1/EX2R, in all the tested samples a unique main band of about 250 bp 
(as predicted for human and mouse genes) was detected on analytical agarose 
gel electrophoresis. Direct DNA sequencing of these fragments demonstrated 

10 that the bulk of the purified PCR product corresponded to the expected Epo 
exon 5 sequence. However, for some DNA samples, purified from horse 
kidney and several cell lines (BHK and LLC-PKl in particular), unknown 
sequences were present to various extents. These minor contaminating PCR 
products generated extraneous peaks in the sequence data, resulting in 

15 ambiguities at several positions. Nevertheless, computer-edited analyses of 
generated 3' non-coding sequences were sufficient to design, if necessary, 
specific primers with greater than 95 % accuracy (data not shown). 

Cloning of Partial cDNAS Encoding the Complete Mature Epo 
Protein. As we previously mentioned, we were unable to obtain any 

20 amplification with cDNAs prepared from renal-derived mammalian cell lines. 
Therefore, we tested our battery of primers on reverse-transcribed RNAs 
prepared from kidney of several mammals. We have successfully amplified 
Epo cDNA from: one primate. Rhesus monkey (Macaca muUata), one 
carnivore, cat {Felis caius), one rodent, rat {Rattus norvegicus) and two 

25 aitiodactyls, pig {Sus scrofa) and sheep {Ovis aries). The aligned nucleotide 
sequences are presented in Figure 3. They have been submitted to GenBank 
and have been assigned accession numbers. 

Rhesus monkey, rat and sheen cDNAs were amplified ii<;ing the 
ATG/NCOl primer-pair combination. Pig and cat sequences were obtained 

30 by the use of a 5' 24-mer specific primer (SPl-5' 
TGCTTCTGCTATCTTTGCTGCTGC 3') [SEQ. ID NO: 3]. IV1/EX2R 
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sequencing demonstrated that this exon 2 sequence was shared by the two 
species. In all PCR amplifications, NCOl was used as the reverse primer. 
Conservation of the NCOl sequence among mammals suggests a potential 
biological role of this nucleotide sequence (perhaps in modulating the stability 
5 of Epo mRNA) (Rondon, I. J. etal., J. Biol. Chem. 266:16594-16598 
(1991)). Considerable homology is also found in the sequenced 3' 
untranslated segment of exon 5 (Figure 4). 

When compared with human Epo, the overall percentages of nucleotide 
identity are, respectively: 93.5 for the Rhesus monkey, 86.7 for the cat, 85.7 

10 for the pig, 83 for the sheep and 75.5 for the rat. Rhesus monkey and 
Cynomolgus monkey (Lin, F.-K. et ai, Gene 4-^:201-209 (1986)) were 99.6% 
identical. Mouse (McDonald, J.D. etai, Mol. & Cell Biol. 6:842-848 
(1986); Shoemaker, C.B. et al.,Mol. & Cell Biol. 6:849-858 (1986)) and rat 
nucleotide sequences showed greater than 93% identity. The two sequenced 

15 aniodactyls, pig and sheep, showed 88% identity.' 

Comparison of Mammalian Epo Primary Sequences 

Propeptides. The predicted amino acid propeptide sequences of several 
mammals are presented in Figure 5. In the primates, there was strong 
conservation of the sequences of deduced propeptides, with only two amino 

20 acid (aa) substitutions at positions -11 (Leu in humans vs. Val in both 
monkeys) and -2 (Leu in humans vs. Pro in both monkeys). Expression of 
Cynomolgus monkey Epo gene in cultured mammalian cells resulted in the 
production of maftire monkey Epo that was elongated at the N-terminus by 
three additional residues: Val-Pro-Gly (Lin, F.-K. et al.. Gene 44:201-209 

25 (1986)). As the Rhesus monkey has the same substitutions, an identical site 



' Several PCR attempts on total RNA purified from the horse's kidney 
were unsuccessful, even when using 5' and 3' ^ific equine primers (SP). 
presumably because analysis on 1 % formaldehyde agarose gel demonstrated 
the degradation of the RNA. 
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of cleavage is likely. The Leu to Pro amino acid replacement at -2 is 
probably respwnsible for the differential activity of the signal peptidase 
observed between man and monkeys. The three rodents, mouse, rat. and 
hamster, were found to have identical propeptide amino acid sequences. 
5 Mature protein. A comparison of the three previously published and 

our five deduced amino acid sequences of mature ^x> proteins is shown in 
Figure 6. Erythropoietin is highly conserved among mammals. The amino 
acid alignment showed that more than 63% of the molecule is composed of 
invariant amino acids (106 residues) (Figure 6A). Most of the observed 
10 substitutions are conservative, involving residues with similar physical and 
chemical properties. The calculated percentages of sequence identity are 
shown in Table II below. 



Table n 

Degree of conservation among mammals 
of the mature Epo proteins 

The percentages of identity among the various sequences were 
determined from the amino acid alignment reported in Figure 6. 





% identity 


Primates 


Man/Rhesus Monkey 


90.5 




Rhesus/Cynomolg\is 


98.8 


Rodents 


man/mouse 


79.8 




man/rat 


82.1 




mouse/rat 


93.5 


Artiodactyls 


man/sheep 


81.0 




man/pig 


81.6 




sheep/pig 


88.7 


Carnivores 


manVcat 


84.5 



There is 100% conservation of Cys7 at the N-terminal portion of the 
polypeptide and Cysl61 near the C-ierminus. This disulfide bridge is essential 
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to the formation of a stable and functional cytokine fWang, F.F. etaL, 
Endocrinology 7/6:2286-2292 (1985)). In contrast, while Cys29 is invariant 
in all the species we examined, Cys33 is present only in primates, sheep, pig 
and cat, but not in the two rodents where there is a proline. The lack of 
5 functional importance of this short disulfide loop is underscored by muteins in 
which tyrosine substitutions at each or both of these sites have no effect on 
Epo's biological activity (See Example II). 

There is also 1(X)% conservation of all three asparagine residues 
responsible for N-linked glycosylation. Even though Epo, deprived of its 

10 carbohydrate cither by enzymatic cleavage (Takeuuchi, M. etal., J. Biol. 
Chem. 265:12127-12130 (1990)) or by production in bacteria (Nahri. L.O. 
et ai, J. Biol. Chem. 266:23022-23026 (1991)) has full in vitro biological 
activity, survival of the hormone in the circulation depends upon N-linked 
glycosylation (Spivack, J.L. et al.. Blood 73:90-99 (1986)). In contrast, the 

15 0-linked glycosylation site (Serl26 in human Epo) is not essential for 
biological function since it is missing in mouse and rat Epos. 

As shown in Figure 6, there is very high conservation of sequence in 
regions which, by algorithms based on primary structure, are predicted to be 
alpha helices (See Example II). The few differences in sequence within these 

20 helical regions are conservative replacements. Indeed, some sites within these 
predicted alpha helices are conserved in corre^nding helices of other 
cytokines (Manavalan. P. etal., J. Proi. Chem. 77:321-331 (1992)). In 
contrast, regions predicted to be inter-helical loops are less well conserved, 
particularly the 14 residue stretch between residues 116 and 130 of human 

25 Epo. In Example II a mutein with a deletion of residues 1 1 1 through 1 19 is 
shown to have normal stability and biological activity, whereas a deletion of 
residues 122 through 126 fails to produce a detectable protein, probably owing 
to markedly impaired stability. 

Similar overall 4 a-helical bundle structure exists or is predicted for 

30 several other cytokines/hormones, such as growth hormone (GH), granulocyte- 
macrophage colony-stimulating factor (GM-CSF), IL-3, ILrA and IL-5 (Bazan, 
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J.F. et al., Immunol. Today 11: 350-354 (1990)). Like Epo. GH exhibits a 
high degree of primary sequence conservation and biological crossreactivity 
between mammals. The mouse, rat, pig and sheep proteins show about 80% 
amino acid identity with human GH. In other cytokines, however, there is 
more sequence diversity among species. For example, the human and murine 
forms of IL-5, GM-CSF and IL-3 show respectively only 69, 67 and 26% 
amino acid homology. Furthermore, unlike Epo or GH, those proteins lack 
inter-species biological cross-reactivity. 

Expression of the Mammalian Epo Hormones. Mammalian Epo 
cDNAs were subcloned into the pSG5 plasmid aiwi transiently expressed in the 
monkey Cos? cell line. Supeniatants of the transfected cells were able to 
sustain the cellular proliferation of the murine Epo-dependent HCD57 cell line, 
demonstrating the biological cross-reactivity between species. As the human 
Epo antisera used in our radioimmunoassay bind to Epos from other species 
with variable affinity, the amount of produced protein was difficult to 
determine accurately. Experiments are in progress to further characterize the 
relative affinities of the various mammalian erythropoietins toward murine and 
human Epo receptors. 

Phylogenetic Analyses of Epo Sequences. Nucleotide sequences 
encoding the full length mature Epo protein were analyzed by the maximum 
parsimony method (Crelusniak, J. et aL,Meth. Enzymol. 785:601-615 (1990); 
Stanhope, M.J. et al.. Mot. Phylog. Evol. 7:148-160 (1992)). An "all trees" 
set of computer programs determined the parsimony length of each of the 945 
unrooted trees formed by the seven cDNA sequences (the terminal laxa) 
(Figure 4) and, on ordering those 945 trees according to increasing length, 
identified the minimum number of extra nucleotide substitutions needed to 
break up each group in the maximum parsimony (lowest length) tree. As 

human and monkey. The pig and sheep are more weakly grouped, in accord 
with accepted phylogenetic views, which place the divergence of these animals 
about 55-60 million years ago. As expected with the parsimony reconstruction 
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on only the codon sequences, silent base substitutions occurred much more 
readily than amino acid replacements (Figure 7B). 

The Epo intron 1-exon 2 sequences shown in Figure 3 were also 
analyzed by the maximum parsimony method. Figure 8 shows strength of 

5 grouping results based on all 135, 135 unrooted trees formed by nine terminal 
taxa (in this case, cat and lion sequences, which are nearly identical, were 
treated as a single taxon), and Figure 8A shows a phylogenetic tree that 
combines maximum parsimony with established phylogenetic evidence. As the 
most parsimonious tree (Figure 7) for these relatively short sequences failed 

10 to join dog to the feloid (cat and lion) branch (reflecting the fact that the 
ancestral split between canoids and feloids traces back to the early carnivores 
at about 60 million years ago), the phylogenetic tree shown in Figure 7A is a 
near most parsimonious tree in which dog joins the feloids. The rodents 
(mouse and hamster) are strongly grouped, as are the feloids (cat and lion), 

15 and at more moderate strength the cetacean (dolphin) is grouped with the 
artiodactyls (sheep and pig). There are other molecular data as well as some 
paleontological data that depict cetaceans originating from early artiodactyls 
(Czelusniak, J. et al. Meth. Enzymol. 7Si:60I-615 (1990); Czelusniak, J. 
et ai. In "Current Mammology", Volume 2, 3rd ed., H. H. Ganoways, 

20 pp. 545-572, Plenum Publ. Corp. (1990); Irwin, D.M. et al., J. Mol. 
Evol. 52:128-144 (1991); Gingreich, P.D. et al, Science 249:\5A-\51 {\990)\ 
Novacek, M.J., Nature 356:121-125 (1992)). 
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EXAMPLE II 

Structure Function Relationships of Erythropoietin: 
Muteins that Test a Model of Tertiary Structure 

Summary 

5 On the basis of its primary sequence and the location of its disulfide 

bonds, we propose a structural model of the erythropoietic hormone 
erythropoietin (Epo) which predicts a 4 a-hclical bundle motif, in common 
with other cytokines. In order to test this model, site directed mutants were 
prepared by high level transient expression in Cos7 cells and analyzed by a 

10 radio-immune assay and by bioassays utilizing mouse and human Epo 
dependent cell lines. Deletions of 5 to 8 residues within predicted a helices 
resulted in the failure of export of the mutant protein from the cell. In 
contrast, deletions at the N-terminus (A 1-6), the C-tcrminus (A 163- 166), or in 
predicted interhelical loops (AB: A32-36, A53-57; BC: A78-82; CD: 

15 All 1-1 19) resulted in the export of immunologically detectable Epo muteins 
that were biologically active. The mutein A48-52 could be readily detected 
by RIA but had markedly decreased biological activity. However, 
replacement of each of these deleted residues by serine resulted in Epo 
muteins with full biological activity. Replacement of Cys29 and Cys33 by 

20 tyrosine residues also resulted in the export of fully active Epo. Therefore this 
small disulfide loop is not critical to Epo's stability or function. The 
properties of the muteins that we tested are consistent with our proposed 
model of tertiary structure. 
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Introduction 

Humoral regulation of red blood cell production was first proposed at 
the beginning of this century (Camot, P., et ai, C.R. Seances Acad. 
Sci. 745:432-35 (1906)). Convincing physiologic experiments documenting 
5 the existence of erythropoietin (Epo) (Krumdieck, N.. Proc. Soc. Exp. Biol. 
Med54:\*-\7 (1943); Rcissmann, K.R., Blood 5:312-3iO (1950); Erslev, A., 
Blood 5:349-357 (1953); Jacobson. L.O.. et ai. Nature ] 79:623-634 (1957)) 
were followed by its purification (Miyake. T., etal.. J. Biol. 
Chem. 252:5558-5564 (1977)) and partial structural characterization (Lai. P.- 
10 H., et ai, J. Biol. Chem. 267:31 16-3121 (1986)). The molecular cloning of 
this biologically and clinically important cytokine (Jacobs, K., etal., 
Nature 3 }3:806-m (1985); Lin, F.-K., etal., Proc. Nat. Acad. Sci. 
USA 52:7580-7585 (1985)) has led to further understanding of its properties 
(Sasaki, H.. etal., J. Biol. Chem. 262:12059-12076 (1987); Davis, J.M., 
15 et ai. Biochemistry 26:2633-2638 (1987)). 

The binding of Epo to its cognate receptor (D' Andrea, A.D., et al, 
Ce// 57:277-285 (1989)) on erythroid progenitors in the bone marrow results 
in salvaging these cells from apoptosis (Koury, M.J., et al., 
Science 245:378-381 (1990)), allowing them to proliferate and differentiate 
20 into circulating erythrocytes. The ^ receptor is a member of an ever 
enlarging family of cytokine receptors (Bazan, F., Biochem. Biophys. Res. 
Commun. 764:788-796 (1989)). In like manner, Epo shares weak sequence 
homology with other members of a family of cytokines which also include 
growth hormone, prolactin, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, GCSF, GM- 
25 CSF, M-CSF, oncostatin M, leukemia inhibitory factor and ciliary 
neurotrophic factor (Parry, D.A.D., etal.. J. Mol. Recognition 7:107-110 
(1988); Bazan, F., Immunology Today 77:350-354 (1990); Manavalan, P.. 
etal.. J. Protein Chem. 77:321-331 (1992)). The genes encoding these 
proteins have similar numbers of exons as well as a clear relationship between 
30 intron-exon boundaries and predicted a-helical structure. These similarities 
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have led to the prediction that this family of cytokines share a common pattern 
of folding into a compact globular structure consisting of four amphipathic 
a-helical bundles. Such theoretical models of the structures of human growth 
hormone (Cohen, F.E., et al., Proteins: Struct. Func. Genet. 2:162-166 

5 (1987)) and IL-4 (Curtis, B.M., et al., Proteins: Struct. Func. 
Genet. 77:111-119 (1991)) have been in remarkably good agreement with 
subsequent structures established by X-ray diffraction (HGH) (Abdel-Meguid, 
S.S., et al.. Proc. Natl. Acad. Sci. USA 54:6434.6437 (1987); deVos, A.M.. 
etal., Science 255:306-312 (1992)) or by multidimensional NMR (IL-4) 

10 (Redfield, C. , et al., Biochemistry 30: 1 1029-1 1035 (1991); Powers, R. , et al., 
Science 256:1673-1677 (1992)). Moreover, the crystal structures of GM-CSF 
(Diederichs, K., et al. Science 254:\119-m2 (1991)) and monomeric M- 
CSF (Pandit, J., et al.. Science 255:1358-1362 (1992)) are also in reasonable 
agreement with their predicted structures. 

15 Thus far, the structure of Epo has not been analyzed by either X-ray 

diffraction or by NMR. In order to begin to gain an understanding of 
structure-function relationships, we have taken a three-pronged approach: 

a) Sequence determination of Epo from mammals of different 
orders in order to establish regions of homology. This work is described in 

20 Example I. 

b) Construction of a model of the three-dimensional structure of 
Epo, followed by the design and preparation of muteins that test this model. 
These experiments are presented in this Example. 

c) Design and testing of muteins that provide information on 
25 receptor binding domain(s). This work is presented in Example III. 
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Materials and Methods 

Computer-based Modeling of Structure 

Prediction of Secondary Structure. Epo sequences from human, 
monkey, mouse, rat, sheep, pig. and cat were aligned (See Example I), and 

5 examined using a hierarchical approach to secondary structure prcdicuon that 
assumes that these proteins are members of the a/a folding class (Uvitt, M. 
et d.. Nature 267:552-558 (1976)). First, the pattern-based method of Cohen, 
F.E. et al., Biochemistry 25:266-275 (1986) for turn prediaion was used to 
delimit sequence blocks likely to contain secondary structure. Predictions using 

10 the methods of Gamier, J. et at.. J. Mol. Biol. 120:91-120 (1978) and Chou, 
P.Y. etaL, Ann. Rev. Biochem. 47:251-276 (1978) suggested o-helical 
regions within these blocks. Finally, helical wheel projections were used to 
examine and then limit helix length based on preserving amphipathic character 
as codified in the work of Presnell, S.R. etaL. Biochemistry 57:983-993 

15 (1992). The locations of glycosylation sites were also used to suggest helix 
boundaries. 

Tertiarj Structure Prediction. Earlier investigations have revealed the 
general principles of helix-to-helix packing in globular proteins (Richmond. 
TJ'., etaL, J. MoL BioL 779:537-55 (1978)). Exploring these principles. 

20 Cohen. F.E. et al.. J. MoL BioL 752:275-288 (1979) developed a method for 
the generation of three-dimensional protein structures from the secondary 
structure assignment. These methods have been applied to myoglobin, tobacco 
mosaic virus coat protein, growth hormone, a- and (3-interfcron, IL-2,and IL-4 
(Cohen. F.E., ei al.. J. MoL BioL 752:275-288 (1979); Cohen, F.E., et aL. 

25 Science 254:349-352 (1986); Cohen. F.E., etaL. Proteins: Struc. Func. 
Gen. 2:162-166 (1987); Sternberg. M.J.E., etaL. Int. J. Biological 
Macromolecules 4:137-144 (1982)). 
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The algorithm for tertiary structure generation is divided into four 
computations. The program aapatch identifies clusters of hydrophobic residues 
within the putative helices that could mediate helix-helix interactions 
(Richmond, T.J., et aL J. Mol. Biol. 779:537-55 (1978)). Aafold generates 

5 ail possible helix pairings according to the location and geometric preferences 
of the interaction sites. Aabuild generates the three-dimensional models of all 
possible structures from the list of helix pairings (from aafold) and subject to 
stearic restrictions and geometric constraints on chain folding. In the final 
step, aavector applies the user defined distance constraints (e.g., disulfide 

10 bridges) to the structures generated. At this stage, coordinates have been 
specified only for residues in the core o-helices. For residues in sequentially 
distinct loops, lower bounds on the inter-residue distances can be inferred from 
the relevant helix terminus. 

Preparation of Epo Muteins 

15 Construction of the Mutagemci Mammalian Expression Plasmid. A 

M13 plasmid, containing a 1.4 Kb EcoRl-EcoRI human Epo cDNA insert 
(XHEpo FL12) was a gift from Genetics Institute (Cambridge, MA) (Jacobs, 
Yi.,.et aL Nature J73:8O6-810 (1985)). A 943 bp EcoRI-Bglll fragment, 
corresponding to the complete coding sequence of the wild type human 

20 erythropoietin, including untranslated regions 216 bp upstream and 183 bp 
downstream, was inserted into the mammalian expression plasmid pSG5 
(Stratagene) (Sambrook, J., et al.. In "Molecular Cloning: A Laboratory 
Manual", 2nd ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 
pp. 1.38-1.41 (1989)) and named pSG5-EpoAVT. 

25 Slit-Dlrecicu Muutgcucsis was earned Gut acccrding *c the protocc! 

described by Kunkel, T.A., et al., Meth. EnzymoL 754:367-382 (1987). 
Singlc-su-anded DNA was rescued from the pSG5-EpoAVT phagemid grown 
overnight in E. coli CJ236, in 2XYT media containing M13K07 helper phage 
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(In Vitrogen) and 70 ,xg/ml kanamacyn (Sigma). The resulting uracil- 
containing ssDNA was used as a template for mutagenesis. Oligonucleotides 
(24 to 46 mer) were synthesized with their 5' and 3' ends complementary to 
the target wild type Epo sequence. A large variety of mutations (base 
substitutions, deletions and insenions) were created at the centers of the 
mutagenic pnmer sequences. Annealing of the phosphorylated primers (10: 1 
oligonucleotide/DNA template molecular ratio) was performed in 10 ,xl of a 
20 mM Tris-HCl pH 7.4, 2 mM MgCl,, 50 mM NaCl solution. The reactions 
were incubated at 80-C for 5 minutes, then allowed to cool slowly to room 
temperature over a one hour period. The DNA polymerization was initiated 
by the addition as a mix of 1 ii\ of lOX synthesis buffer (100 mM Tris-HCL 
pH 7.4. 50 mM MgCl,. 10 mM ATP. 5 mM each dNTPs. 20 mM DID, 0-5 
^1 (8 units) of T4 DNA ligase and 1 ^1 (1 unit) of T4 DNA polymerase 
(Boehringer Mannheim). After 2 hours at 37°C. 80 ^1 of IX TE was added. 
5 ;xl Of the diluted reaction mix was used to transform competem E. coli 
NM522 (ung^ dut*). 

Since a 40-80% mutation yield is normally obtained. 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced with 7-deaza- 
dGTP and Sequenase (U.S. Biochemical) (Sanger, F.. et al, Proc. Natl. 
Acad. Sci. USA 74:5463-5467 (1977)). As a rule, the entire coding sequences 
of the Epo mutants were examined for the presence of unwanted mutation by 
sequencing or restriction enzyme mapping. 

Production of WiJd Type Epo and Epo Muteins in Mammalian Cells. 
Cos7 cells grown to approx. 70% confluency were transfected with 10 ^tg 
recombinant plasmid DNA per 10 cm dish using the calcium phosphate 
precipitation protocol (Kingston. R.E.. era/-. In "Current Protocols in 
Molecular Biology". Green Publishing Associates & Wiley interscience, NY, 
pp. 9.1.1-9.1.9 (1991)). As a control of uansfccllon efficiency, ;r. several 
experimems 2 ^g of pCHl 10 plasmid (Pharmacia) was co-transfected and /3- 
D galactosidase activity measured in the cytoplasmic extracts. 
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RNA Blot-Hybridization Analysis. Total RNAs were prepared from 
cultured Cos7 cells (Chirgwin. J.M.. et aL, Biochemistry /5:5294-5299 
(1979)) and 2 samples elecirophoresed on a 1.1% agarose gel containing 
2.2 M formaldehyde. Transfer to GeneScreen Plus filters (New England 
5 Nuclear) and hybridization with a "P-labeled wt Epo probe were carried out 
as previously described (Faquin. W.C.. et al.. Blood 79:1987-1994 (1992)). 

Quantitation of Transiently-Expressed Recombinant Epos. The 
amount of secreted protein in the supematants of transfected Cos7 was 
determined by a radioimmune assay (RIA). The RIA was performed using a 
10 high titer rabbit polyclonal antiserum raised against the human wild type Epo 
and produced in our laboratory. ''-^I-labeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin, W.C., et al., Exp. Hematol. 21: 420-426 (1993)). 

Immunoprecipitation of "S-labeled Epo Proteins. Three days after 
15 transfection. the Cos7 monolayers were washed extensively with IX PBS and 
the cells incubated for 20 minutes at 37°C in 2 ml of MetVCys" Minimum 
Essential Media Eagle modified. In each culture dish, 100 n\ of TRAN35S- 
LABEL (["S]-cysteine and ["5]-methiomne, ~ 10 mCi/ml, ICN Biochemical) 
was then added. After two hours, the condiuoned media were harvested and 
20 cellular extracts were prepared by lysis in radioimmune precipitation buffer 
(50 mM Tris-HCl pH 8.0, 150 mM NaCl, 0.02% (w/v) sodium azide, 0.1 % 
(w/v) SDS, 0.5% (w/v) sodium deoxycholate, 1% (w/v) Triton X-100, 1 mM 
phenylmcthanesulfonyl fluoride and 1 ^g/ml aprotinin). Samples were 
precleared with rabbit preimmune serum/protein A-Scpharosc CL-4B 
25 (Pharmacia) for two hours. Immunoprecipitations were performed overnight 
with our polyclonal antibody specific for human recombinam wild type Epo 
and immunoadsorbed with protein A-Sepharose CL-4B. Immunoprecipitates 
were run on 15% SDS-polyacrylamide gels (Laemmii, U.K., Naiure 
227:680-685 (1970)) and analyzed by autoradiography after treatment with 
30 Enhance (New England Nuclear). 
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Bioassays. The dose-dependent proliferation activities of wild type Epo 
and Epo mutcins were assayed in vitro using three different target cells: 
murine spleen cells, following a modification of the method of Krystal 
(Krystal, Q.,Exp. Hematol. 11:649-60 (1983); Goldberg, M.A.. et ai, Proc. 
Nail. Acad. Sci. USA 84:7972-7976 (1987)); Epo re^nsivc murine 
erythroleukemia cell line, developed by Hankins (Hankins, W.D.. et aL. 
Blood 70-. 173a (1987)); and human Epo-dependent UT-7/^ceIl line, derived 
from the bone marrow of a patient with acute megalcaryoblastic leukemia 
(Komatsu, N., et aL. Cancer Res. 57:341-348 (1991)). After 22 to 72 hours 
of incubation with increasing amounts of recombinant proteins, cellular growth 
was determined by pH]-thymidinc (New England Nuclear) uptake or by the 
colorimetric MTT assay (Sigma) (Yoshimura, A., et ai. Proc. Natl. Acad. 
Sci. USA 57:4139-4143 (1990)). 

Bacterial Expression. The wild type Epo target, corresponding to the 
nucleotide sequence coding for the mature protein, was PCR-amplified using 
appropriate primers. In the sense primer an Ndel site (CATATG) was placed 
immediately 5' to the Alal codon of the mature protein. In the antisense 
primer a Bglll site was placed 3' to the TGA stop codon. After enzymatic 
digestion, the 516 bp PGR fragment was inserted in an Ndel/BamHI-cut 
pET16b plasmid (Novagen), which has a T7 promoter followed immediately 
by the lac operator. IPTG induction of transformed E. coli BL21 (DE3) (T7 
RNA polymerase*. Ion', ompT") resulted in high levels of expression of a 
fusion protein with a ten histidine stretch at the amino terminus. The oligo- 
His tag allowed the binding of the produced (HiSio)-Epo on a nickel affinity 
resin and its elution by increasing imidazole concentrations in presence of 
phenylmethylsulfonyl fluoride (Sigma). Most of the produced protein formed 
insoluble aggregates and was solubilized and affmity-purified under denaturing 



overnight dialysis at 4''C against 50 mM Tris-HCl pH 8.0, 40 /iM CuSO, and 
30 2% (wt/vol) Sarkosyl. Soluble protein was further dialyzed against 20 mM 
Tris-HCl pH 8.0. 100 mM NaCl. 2 mM CaCU and subjected to factor Xa 
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(New England Biolabs) cleavage to remove the NHj-polyHis sequence. 
Monitoring of the fusion protein following induction and during the various 
steps of purification was done by electrophoresis on a 15% pclyacrylamide- 
SDS gel, stained with Coomassie Brilliant Blue. Alternatively, the His-Epo 

5, fusion protein was detected on Western Blot (Gershoni, J.M., et ai, Anal. 
Biochem. 131:1-15 (1983)), using a 1/2000 dilution of our wt native Epo 
polyclonal antibody and a second biotinylated rabbit-q)ecific antibody which 
is detected with a streptavidin-alkaline phosphatase conjugate (Amersham). 
In Vitro Transcription/Transladon. Sense and antisense primers, 

10 creating new Bglll sites respectively 5' and 3' of the initiator and stop codons, 
were used in a polymerase chain reaction on pSG5-EpoAVT template. After 
Bglll cleavage, the 594 bp PCR-fragment was subcloned into p SP64T (Krieg, 
P. A., etal.. Nucl. Acid. Res. 12:lQSl-im (1984)). This SP6 containing 
vector provides 5' and 3' flanking regions from Xcnopus jS-globin mRNA, 

15 which allow efficient in wro transcription/translation. Previous experiments 
showed poor yields of in vitro translated protein, when using the GC-rich 
natural 5' Epo untranslated region (UTR). One step in vitro transcription/ 
translation was carried out by incubation of 1 /xg of circular p64T-Epo in a 50 
/il reaction volume of SP6-TnT coupled rabbit reticulocyte lysate system 

20 (Promega), in presence of p*5]-cysteine (1200 Ci/mM, New England 
Nuclear). In some cases, canine pancreatic microsomal membranes were 
added to the reaction mix. A purified GST-human Epo receptor extracellular 
domain fusion protein (EREx) was a gift from W. Harris and J. Winkelman, 
and the binding of the ^S-labeled translation products onto EREx-glutathione 

25 agarose beads were performed as described (Harris, K.W., et al, J. Biol. 
Chem. 267:15205-15209 (1992)). 
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Results 

Construction of a Model of the Three-Dimensional Structure of 
Erythropoietin 

From an analysis of the putative Epo helix sequences, aapatch 

5 identified eight possible helix-helix interaction sites. In principle, these sites 
could be used to generate 1.6 x 10* structures. Of these, only 706 maintained 
the connectivity of the chain and were sterically sensible. These structures 
resembled four helix bundles, an increasingly common motif in protein 
suiicture (Presneli, S.R., et al, Proc. Natl. Acad. ScL 86:6592-6596 (1989)). 

10 The strucnires that were not compatible with the native disulfide bridge 
between Cys, and Cys,4, were eliminated. This reduced the total number of 
structures from 706 to 184 (total computer time approximately 1 hr. on a 
Silicon Graphics IRIS 4D/35G). The remaining strucnires were then rank 
ordered by solvent-accessible surface contact area, a measure of the validity 

15 of model structures. The most compact structures were right handed, all anti- 
parallel four-helix bundles with no overhand connections, but this may be an 
artifact of a failure to add the polypeptide chain that forms the loops to the 
helical core constructed by aabuild. The other less compact structures were 
left-handed four-helix bundles with two overhand loops, a topology previously 

20 seen in the strucnires of IL-4 and growth hormone. We suspect that this is the 
likely structure for Epo. The consensus for assignments of putative a-helices 
in human Epo are summarized in Table III. First, analysis of the topological 
distribution of known four-helix bundle structures indicate that nearly all 
examples have an antiparallel orientation (Presneli, S.R., et al., Proc. Natl. 

25 /4cfl£f. 5c7. 56:6592-6596(1989)). Second, the left-handed four-helix bundles 
with two overhand connections arranges the tour amphipathic helices to form 
a compact hydrophobic core. Finally, a path for the loop regions of Epo can 
be imagined by analogy to IL-4 and growth hormone that preserves the 
compact globular nanjre of the Epo model strucwre. Figure 9 shows 
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schematic representations of the possible topological interactions between the 
four-anti-parallel a-helicai bundles. 



5 







Table III 




Predicted a-helicai regions or the mature erythropoietin protein 


Helix 


N-tcrminus 


C-terminus 


Potential Helix-Helix 
Interaction Sites 


A 


9 


22 


19 


B 


59 


76 


63. 67, 70. 71 


C 


90 


107 


95. 102 


D 


132 


152 


141 


Data were obtained using the various algorithms for secondary and 
tertiary structure generations described under Materials and Methods. 



Several authors have suggested that the helical cytokines form a 
structural superfamily (Cohen, F.E., etai, Science (1986); 
Parry. D.A.D.. et al., J. Mol. Recognition 7:107-110 (1988); Bazan, J.F., 

15 Biochem. Biophys. Res. Comm. 164:lii-19S (1989); Cosman, D.. et al., 
Trends Biochem. Sci. 75:265-270 (1990); Bazan, J.F., Proc. Nail. Acad. Sci. 
U.S.A. 87:6934-6938(1990)). On the basis of both the mature protein and the 
individual a-helices, Epo seems to be more closely related to growth hormone, 
prolactin, IL-6 and GM-CSF rather than the other members of the helical 

20 cytoldne superfamily. Nevertheless, recent improvements in algorithms for 
the identification of distant evolutionary relationships between proteins from 
structural fingerprints suggested that it might be possible to align the IL-4 
structure to the Epo sequences. The Eisenberg (Bowie, J.U., et al., 
Science 253:\(A-nO (1991)) structural environment and 3D-1D profile 

25 methods are a powerful tool for recoRnizine that a sequence is compatible with 
a known structure, e.g., a four-helix bundle. The NMR structure of IL-4 
from Smith, L.J., etai. J. Mol. Biol. 224:899-904 (1992) was used to 
construct a 3D- ID profile. A mixture of sequences including four helix 
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bundles, globins, and non-helical stnictures were aligned against the lL-4 
profile. Not surprisingly, the IL-4 structures from human and mouse gave the 
highest scores {V = 22.8 and 8.1). However, the other known four-helix 
bundle cytokines known to share a similar fold with IL-4, e.g.. human growth 

5 hormone (Abdel-Meguid. S.S.. et al. Proc. Natl. Acad. ScL 5^:6434-6437 
(1987)) (Z = 2.3) and GM-CSF (Diederichs, K., et al.. Science 
25-^:1779-1782 (1991)) (Z = 2.3) fared no better than some globin sequences 
(Kuroda's and slug sea hare globin, Z = 5.0 and 4.8) that adopt a distinct 
tertiary structure. The results for the human and sheep Epo sequences were 

10 also ambiguous (Z = 1 .6 and 0.8). These results suggest that while profile 
methods are a powerful tool for recognizing structural similarity, their failure 
to identify homology does not exclude the possibility that two proteins share 
a common fold. For distantly related or unrelated structures, current profile 
methods cannot replace de novo methods for tertiary structure prediction. 

15 Design and Expression of Epo Muteins that Test the Proposed Structure 

To test the proposed four o-helical bundle structure of erythropoietin 
and at the same time to attempt to locate functional domains, we created by 
site-directed mutagenesis a series of deletion, insertion and replacement 
miitants. These muteins were designed to analyze the principal predicted 
20 structural features of the molecule: a-helices, interconnecting loops, as well 
as the NH, and COOH termini. Structural and functional implications of the 
disulfide bridges and the glycosylation sites were also investigated. 

a Helices. Short amino acid deletions were prepared in, or close to, 
the predicted A. B, C and D a helices. Human wild type and muteins were 



25 ^Z-scores are used to describe the normalized weight associated with a 

profile score. A distribution is built from a collection of sequences with a 
mean 2 score of 0.0 and a standard deviation of 1.0. Z-scorcs greater than 
6.0 are associated with significant alignments. Z-scorcs between 3.0 and 6.0 
may or may not be structurally relevant. 
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transiently expressed in Cos? cells. Northern blot analyses demonstrated that 
all the mutant plasmids produced about the same amount of mRNA as that of 
the wild type {data not shown). Yet, no detectable amount of Epo protein 
could be found in the Cos? supematants, either by radioimmunoassay or by 
5 bioassay using various Epo-dependent cell lines. Table IV summarizes these 
findings. 

TABLE IV 

Shon deletions in, or close to, a helices 

mutants. 
A 12-16 



A 65-69 



A 96-100 
A 105-109 



A 122-126 

A 131-135 
A 140-144 
A 142-150 
A 152-155 

A 156-160 

Predicted N and C termini of each a helix are indicated in the vertical boxes. 

An example of SDS-PAGE of immunoprecipitants from in vivo ^'S- 
labeling is presented in Figure 10. As expected, when Cos? cells were 
transfeaed with pSG5-EpoAVT. a 35-3? KD band was detected in the 
supernatant. In contrast, the deletion mutants (Table IV) could be detected in 
5 cellular extracts but were not exported from the cells. Figure 10 shows the 



90 
i 

107 



- RNA levels comparable to WT. 

- no detectable Epo in the Cos? 
supernatant, both by RIA and 
bioassay. 



132 
152 
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cytoplasmic retention of the mutein A140-144, lacking four residues in the 
middle of the predicted D helix. The apparent molecular weight (around 28 
IcD) is less than expected for a five-amino acid deletion. Therefore not only 
the secrcuon, but also the glycosylation, seem to be impaired. None of these 
muteins had deletion of glycosylation sites. It is likely that full glycosylation 
of Epo requires conservation of its molecular architecture. Similar results 
(reported in Table IV) were obtained for all the muteins having partial deletion 
of an a helical peptide segment. 

Because contaminants in crude Cos? cellular extracts severely interfere 
with the radioimmunoassay, no direct Epo quantitation was possible. 
However, aliquots of hypotonic extracts of Cos? transfected with wild type 
Epo were able to sustain HDC57 proliferation. No similar biological activity 
was found for muteins with limited deletion of a helices. 

Interconnecting Loops. The peptide segment joining A and B helices 
presents several interesting features (Figure 11). AB loop consists of 36 
amino acids. Two N-glycosylation sites and a small disulfide bridge are 
located in the first half and their biological implicauons will be discussed later. 
The COOH end of the AB loop contains a stretch of amino acids that is 
strongly conserved among mammals (See Example I). Alignments of hunaan, 
monkeys, cat. mouse, rat. pig. and sheep Epos showed a consensus sequence: 
DTKVNFYAWKR(M/I)(E/D)VG (residues 43 to 5?) [SEQ. ID NO: 4). 
Three deletions were constructed: A43^?, A48-52. and A53-5?, and 
transiently expressed in Cos?. The amount of muteins detected by RIA in the 
supcmatants of transfected cells was 10 to 40% lower than observed with wild 
type Epo (Figure llA). Nevertheless, the three secreted muteins were 
biologically active. However, because M8-52 exhibited a marked decrease 
of the specific bioactiviiy, this site was studied in more detail by means of 
serine replacements. Krystai ex vivo bioassay as well aS KCD57 and 
UT-?/Epo in vitro bioassays showed that these Ser mutants had biological 
activiues similar to that of wild type (Figure IIB). Therefore, the observed 
decreases in both RIA and bioassay for the three deletion mutants are likely 
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10 be the result of changes of structural conformation. The long length of loop 
AB may be critical for the up-up-down-down topognqjhy. A shorter AB 
segment may impose a strain on the interhelical connection. Chou and 
Fasman algorithms (Chou, P.Y., et ai, Ann. Rev. Biochem. 47:251-276 

5 (1978)) predicted a short jS-sheet structure from residues 44 to 51 (<Pa> ; 
<Pb>, 1.005 < 1.196). The presence of a short region of /3-sheet in the 
connection between helices 1 and 2 (A and B) have been documented in the 
analyses of the three-dimensional structures of lL-4 (Redfield, C, el al.. 
Biochemistry 50:11029-11045 (1991); Powers, R.. etal., Science 256:1612- 

10 1677 (1992)), GM-CSF (Diederichs, K., etal.. Science 254:1779-1782 

(1991) ), and monomeric M-CSF (Pandit. J., et al.. Science 258:1358-1362 

(1992) ). In contrast, in human GH a short segment of a helix is found at the 
same location (Abdel-Meguid. S.S., etal.. Proc. Natl. Acad. So. USA 
54:6434-6437 (1987)). The structure/function implications of these short 

15 features are not yet understood. 

Helix B is linked to helix C by a much shorter segment (residues 77 
to 89) and contains in its center the third N-glycosylaiion site (Asn83). When 
the A78-82 mutein was expressed, a secreted protein was detected in the 
conditioned medium and conferred proliferative bioactivity on Epo-dependent 

20 cell lines (see Figure 16). 

A similar long crossover connection (23 amino acids) is found between 
helix C and helix D. In contrast to what we previously observed for loop AB, 
a large deletion of nine residues at position 111-119 or a seven amino acid 
insertion of a myc epitope after residue 116 did not affect the secretion of 

25 these muteins (Figure 12). Furthermore, these two proteins had normal 
specific activity, as seen by the ratio of bioassay to RIA. Our rabbit 
polyclonal antibody raised against the native form of the human wild type fully 
recognized ihe two mumiiis, ucuiuiisiiaiirig that the overall spaual 
conformation of Epo was well preserved. According to the algorithm of 

30 Emini, E.A., et al.. J. Virol. 55:836-839 (1985), the residues 111-119 are 
predicted to be at the surface of the molecule. Primary amino acid alignments 
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of mammalian Epo showed a large variation in ihc sequence of residues 1 16 
to 130, including amino acid deletion, insertion, and substitution (See 
Example I). Surprisingly, when the deletion A122-126 mutein, which 
removed the 0-glycosylation site (Ser 126), was transiently expressed in 

5 monkey cells, protein secretion was inhibited. Both rodents, rat and mouse, 
lack the 0-giycosylation site because of a Serl26 to Pro replacement. 
Furthermore, when a Serl26 replacement mutein was expressed in normal 
CHO cells. (DeLorme, E.. et al., Biochemistry 557:9871-9876 (1992)) or 
when wild type Epo was expressed in cells having a defect in O-linked 

10 glycosylation (Wasley, L.C., ei al.. Blood 77:2624-2632 (1991)), neither 
secretion nor biological activity were impaired. Therefore, failure of secretion 
of the A122-126 mutein may be the result of some other structural alteration. 
In particular, the proline residue at position 122 is invariant among mammals. 
NandC Termini. Deletion of residues 2 to 6 only slightly affected 

15 the processing of a biologically active protein (see Figure 16). This deletion 
may impair cleavage of the propeptide, therefore explaining the lower yield 
of secreted Epo mutein in comparison to that of wild type. The fact that the 
mawre monkey protein has an elongated (Val-Pro-Gly) N-terminus strongly 
suggests that the N-terminal part is not involved in the bioactivity of the 

20 molecule. Further evidence comes from the results, reported below, on the 
N-poly-His-Epo fusion protein expressed in E. coli, and also from the identical 
binding of /« vitro translated "S-labeled wild type Epo onto EREX-glutathione 
agarose beads (Figure 13), with or without addition of canine pancreatic 
microsomal membranes which permit cleavage of the propeptide.' 

25 The C-terminal sequence following helix D can clearly be divided into 

two distinct domains, separated by Cysl61. The residues 151 to 161 were of 
special interest because they are highly conserved among mammals (See 
c»..j»,«i» l^ There are en!" two substitutions: Lvsl6^ rp"!?c^<? bv a Thr in 



'All the mutants described in this example were subcloned into pSPG4T 
30 plasmid. 
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articxlactyls and cat. and Alal60 is replaced by a Val in mouse Epo. Both the 
A152-155 and the A156-160 muteins remained in the cytosol of the transfected 
Cos7 (Table IV). One possible explanation is that the residues 152 to 160 
may, in fact, participate in the D helix. We predict that GlylSl is the break 

5 point of the structure. However, it is possible that this residue causes only a 
bend in the a helical structure and helix D may extctxl to Glyl58. 

The C-terminal pan of the protein (residues 162 to 166) is clearly not 
involved in any structural or functional feature. Thus, the deletion of the four 
last amino acids or the replacement of residues 162-166 by cither a KDEL 

10 sequence or a poly-histidine sequence* did not modify the specific activity of 
the erythropoietin (Figure 14). Radioimmunoassay revealed that the secretion 
of the KDEL-tail mutcin in the media of transfected cells was 4596 less than 
normally obtained with the wild type Ejjo. However, when compared to the 
wild type, this mutein had more biological activity in the hypotonic Cos? cell 

1 5 extracts. The KDEL COOH-terminal sexjuence has been shown to be essential 
for the retention of several proteins in the lumen of the endoplasmic reticulum 
(Andres, D.A., etal., J. Biol. Chem. 266:14288-14282 (1991)). 
Nevertheless, because of overproduction in transiently expressed cells, a large 
percentage of recombinant protein escaped into the media. 

20 Disulfide Bridges. Wang et al. demonstrated that the biological 

activity of Epo was lost irreversibly if the sulfhydryl groups were alkylated 
(Wang, F.F., et al., Endocrinology 116:22S6-2292 (1985)). The mature 
human Epo has two internal disulfide bonds: Cys7=Cysl61, linking the NH, 
and COOH termini of the protein and a small bridge between Cys29 and 

25 Cys33. Cys33 was previously changed to Pro by site-directed mutagenesis, 
and the resulting protein was reported to have greatly reduced in vitro 
biological activity (Lin. F.-K.. Molecular and Cellular Aspects of 



*Thc poly-His tail wild type mutant was purified by means of nickel 
affinity chromatography which will enable quantitation of cytosolic-retained 
30 mutants, The (His)^ C(X)H-terminal sequence has been appended to all the 
muteins described in this example. 
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Erythropoietin and Erythropoiesis H8:23-36, NATO ASl Senes, Springer- 
Verlag, Berlin Heidelberg (1987)). However, rat and mouse Epos have the 
same substitution and yet exhibit full cross-species bioactivity. To resolve the 
role of the small disulfide bridge in human Epo function, we created a 

5 C29Y/C33Y double mutation. The resulting mutein was normally processed 
and showed the same in vitro bioactivity as the wild type (Figure 11 A).* 
Furthermore, the deletion of five amino acid residues (A32-36) did not impair 
the secretion of a biologically active mutein. These data demonstrated that only 
the native and fully conserved disulfide bridge Cys7=Cysl61 is crucial for the 

10 preservation of the molecular structure of erythrc^ietin. 

Functional Role of the Glycosyladon. Natural or recombinant human 
Epo is a heavily glycosylated protein; 40% of its molecular weight is sugars 
(Davis, J.M.. etal, Biochemistry 26:2633-2638 (1987)). The protein has 
three N-linked oligosaccharide chains, located at amino acid positions 24 

15 and 38 (in predicted loop AB) and position 83 (in loop BC). It has one 0- 
linked carbohydrate chain at position 126 (in loop CD), which is missing in 
rodents. The role of these sugar chains in the biological activity of the human 
hormone has been extensively studied. Site-directed mutagenesis at die N- 
glycosylation sites demonstrated that even though the sugars were important 

20 for proper biosynthesis and secretion, their removal did not affect in vitro 
activity. This finding was corroborated by several investigators (Yamaguchi, 
K., et al, J. Biol. Chem. 266:20434-20439 (1991); Dub6, S., et al, J. Biol. 
Chem. 263:17516-17526 (1988)). However. Takeuchi, M.. etal.. J. Biol. 
Chem. 265:12127-12130 (1990) found that N-glycanase digestion results in 

25 almost complete loss of biological activity. In contrast, there is general 
consensus that glycosylation plays a key role in the biological activity of the 
hormone in vivo. Various reports have demonstrated that the N-linked sugar 
chain; enhance the stabiiit" and sur.'iva! of E"o ir^ the blood stream (F'jkuda, 



^wo single replacement muteins (C29Y and C33Y) were also stable and 
30 had full biological activity (results not shown). 
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M.N., eial.. Blood 75:84-89 (1989); Tsuda. E., et ai. Eur. J. Biochem. 
iS8:405-411 (1990)) and protect the hormone against clearance by the liver 
(Sasaki, H., et ai, J. Biol. Chem. 262: 12059-12076 (1987)). thereby enabling 
the transit of the hormone from its site of production in the kidney to its target 
5 cells in the bone marrow (Spivak. J.L., et ai. Blood 75:90-99 (1989)). 

We expressed the wild type Epo in E. coli. Accordingly, the produced 
protein completely lacks sugar. The pET expression system was used and is 
detailed in the Methods section. IPTG induction of transformed 8L21 (DE3) 
bacterial strain rapidly results in a high level of expression of the poly-His Epo 
10 fusion protein (Figure 15A and 15B). After three hours of induction, we 
obtained a typical yield of - 1 mg of total protein per ml of culture. 
However, the vast majority of the expressed protein was present in the 
inclusion bodies and therefore its solubilization and purification on the nickel 
beads was performed in 6 M guanidine-HCl. Oxidative refolding and factor 
15 Xa cleavage resulted in soluble forms (Figure 15C) and the in vitro biological 
activity was tested on HDC57 cells. The cleaved E. coli recombinant Epo 
showed a notable decrease of the specific activity (10% less than the fully 
glycosylated mammalian expressed protein), but was still able to maintain 
HCD57 proliferation. The observed reduction of in vitro activity is likely to 
20 be due to improper refolding of the insoluble protein and impaired physical 
stability of the E. coli Epo as previously reported (Narhi. L.O., et ai, J. Biol. 
Chem. 266:23022-23026 (1991)). However, the fact that the E. coli Epo was 
still able to trigger HCD57 growth indicated an overall preservation of the 
molecular structure. The uncleaved fusion protein, with 10 His residues at the 
25 N-terminus, exhibited a 67 % loss in biological activity when compared to the 
cleaved protein. Thus, the addition of a 20-residue sequence to the N- 
terminus partially inhibited the biological activity. 
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Discussion 



Currently the accrual rate of new protein sequences through gene 
cloning far outstnps the rate of determination of three dimensional structure. 
Epo is among a large number of biologically important proteins which have 
not yet been analyzed by X-ray diffraction or NMR. The problem is simplified 
by cumulative evidence that the structures of most proteins arc likely to be 
variations on existing themes (Levitt. M.,etal., Nature 261:552-55% (1976)). 
Indeed, as mentioned above. Epo appears to share common structural features 
with a large group of cytokines (Parry. D.A.D., et aL. J. Mol. 
Recognition i:107-110 (1988); Bazan, F., Immunology Today 77:350-354 
(1990); Manavalan, P., et al., J. Protein Chem. 77:321-331 (1992)). 

Computer-based prcdicuon of structure can be reduced to a three stage 
process: secondary structure is predicted from the primary amino acid 
sequence and, when available, optical measurements. Analysis of Epo by 
circular dichroism reveals about 50% a helix and no detectable ^ sheet (Lai, 
P.-H.. etal., J. Biol. Chem. 267:3116-3121 (1986); Davis, J.M., et al, 
Biochemistry 26:2633-2638 (1987)). With the knowledge of disulfide bonds, 
secondary structural elements are then packed into a set of alternative tertiary 
structures. The number of plausible anangcmcnts can be reduced by 
empirical knowledge of preferred helix-helix packing geometries and the need 
for globular structure to form a hydrophobic core. The putative tertiary 
structure is then refined by standard force field calculations. Since there are 
a large number of alternate tertiary structures, the availability of 
experimentally determined structure of a homologous protein is critically 
important. Thus the predicted model of Epo structure gains considerable 
validity by knowledge of the structures of GH (Abdcl-Meguid, S.S., et al, 
Proc. Nail Acad. Aa. t/M «4:0434-b437 (1987); deVos, a.m., etal, 
Science 255:306-312 (1992)) and lU (Redfield, C, etal. 
Biochemistry 30: \\029-nOA5 (1991); Powers, R., etal. 
Science 256:1673-1677 (1992)). 
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We have tested the predicted four anti-parallel a-helical bundle 
structure by means of site-directed mutagenesis. Deletions within predicted 
a helices would be expected to destabilize tertiary structure, whereas deletions 
or insertions in non-helical segments should be permitted unless they impose 

5 undue strain on the structure. For example, a deletion in an overhand inter- 
helical loop may result in insufficient length to connect the two helices. 
Results that we have obtained on muteins produced in mammalian (Cos!) cells 
are summarized in Figure 16. Our measurements of the quantities of 
processed mutein by RIA may underestimate the true amount of secreted Epo. 

10 Even a small deletion or insertion can result in a conformational change that 
may lead to impaired binding by our polyclonal antibody, raised against native 
human Epo. Thus the values for specific activity (biologic activity/RIA) that 
we report must be regarded as approximations. This caveat notwithstanding, 
our mutagenesis results are in good agreement with our proposed four 

15 a-helical model of erythropoietin. The proper folding of Epo into its native 
tertiary structure is necessary for stability and biological function. Muteins 
with shon deletions inside predicted a-helices were not processed and exhibited 
no biological activity. In contrast, when deletions were created in predicted 
interconnecting loops, secreted proteins were detected, to varying degrees, 

20 botii by radioimmunoassay and bioassay. Furthermore, additions or deletions 
at the N- or C-termini did not markedly impair the secretion and the biological 
activity of the Epo protein. Moreover, mutations at Cys29 and Cys33 showed 
that the small disulfide loop is not critical for biological activity. In order to 
delineate Epo's functionally important residues involved in the direct binding 

25 onto the Epo receptor, we have prepared and tested a series of amino acid 
replacements on the surfaces of the predicted a helices. These experiments 
are described in a Example III. 
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EXAMPLE III 



Erythropoietin Structure-Function Relationships: 
Muteins That Test Functionally ImporUnt Domains 



Introduction 



5 The Epo receptor is a member of an ever enlarging family of cytokine 

receptors (Bazan, P., Biochem. Biophys. Res. Commun. 164-J&i-1% (1989)). 
In lilce manner, Epo shares wcalc sequence homology with other members of 
a family of cytokines which also include growth hormone, prolactin, IL-2, 
IL-3, 1^4. IL-5. IL-6. IL-7, G-CSF. GM-CSF, M-CSF, oncostatin M, 

10 leukemia inhibitory factor and ciliary neurotrophic factor (Parry, D.A.D. 
etaL, J. Mol. Recognition i:107-110 (1988); Bazan F., Immunology 
Today i7:350-354 (1990); Manavalan, P. et al.J. Protein Chem. iJ:321-331 
(1992)). The genes encoding these proteins have similar numbers of exons as 
well as a clear relationship between intron-exon boundaries and predicted a- 

15 helical bundles. Such theoretical models of the strucnires of human growth 
hormone (Cohen. F.E. etaL, Proteins: Struct. Func. Genet. 2:162-166 
(1987)) and IL-4 (Curtis. B.M. et al.. Proteins: Struct. Func. Genet. 
i i : 1 1 1 - 1 1 9 { 1 99 1 )) have been i n remarkably good agreement with subsequent 
structures established by X-ray diffraction (HGH) (Abdel-Mcguid. S.S. et al., 

20 Proc. Natl. Acad. Sci. USA 54:6434-6437 (1987); de Vos, A.M. et al.. 
Science 255:306-312 (1992)) or by multidimensional NMR (1L4) (Redfield, 
C. etal.. Biochemistry 30:11029-11035 (1991); Powers. R. etaL, 
Science 256: 1673-1677 (1992)). Moreover, the crystal structures of GM-CSF 
(Diederichs. K. et aL, Science 254:\m-m2 (1991))and monomeric M-CSF 

25 (Pandit, J. etaL, Science 25S: 1358-1362 (1992)) are also in reasonable 
uoTrrmfm with rhftir prwlirted srnicnire<; 

The structure of Epo has not yet been analyzed by either X-ray 
diffraction or by NMR. In order to begin to gain an understanding of 
structure-function relationships, we have taken a three-pronged approach: 
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a) 



Sequence dettrminacion of Epo from mammals of differeni 
orders in order to establish regions of homology. This work is 
described in Example 1. 



b) 



Construction of a model of the three-dimensional structure of 



5 



c) 



Epo, followed by the design and preparation of muteins that test 
this model. These experiments are presented in Example II. 
Design and testing of muteins that provide information on 
receptor binding domain(s). This work is presented in this 
Example. 



10 



MATERIALS AND METHODS 



Construction of the Mutagenic/Mammalian Expression Plasmid. A 
M13 plasmid, containing a 1.4 Kb Ecd^-EcdK\ human Epo cDNA insert 
(XHEpo FL12) was a gift from Genetics Institute (Cambridge, MA) (Jacobs, 
K. et ai, mure 575:806-810 (1985)). A 943 bp EcoRl-Bglll fragment, 

15 corresponding to the complete coding sequence of the wild-type human 
erythropoietin, including untranslated regions 216 bp upstream and 183 bp 
downstream, was inserted into the mammalian expression plasmid pSG5 
(Stratagene ) (Sambrook, J. ei aL, In "Molecular Cloning: A Laboratory 
manual", 2nd ed. Cold Spring Harbor, Laboratory, Cold Spring Harbor, 

20 N.Y., vol. 1, pp.1.38-1.41 (1989)) and named pSG5-EpoAVT. 

Site-Directed Mutagenesis was carried out according to the protocol 
described by Kunkel, T.A. etal., Meth. Enzymol. 754:367-382 (1987). 
Single-stranded DNA was rescued from the pSG5-EpoAVT phagemid grown 
overnight inf. coli CJ236, in 2XYT media containing M13K07 helper phage 

25 (In Vitrogen) and 70 ng/m\ kanamacyn (Sigma, St. Louis, Mo). The resulting 
urscii-coiitaiTung ssDNA 'was used as a tcrnpiatc for niutagcricSiS. 
Oligonucleotides (24 to 46 mer) were synthesized with their 5' and 3' ends 
complementary to the target wild-type Epo sequence. A large variety of 
mutations (base substitutions, deletions and insenions) were created at the 
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centers of the mutagenic primer sequences. Annealing of the phosphorylated 
primers (10:1 oligonucleotide/DNA template molecular ratio) was performed 
in 10 ti\ of a 20 mM Tris-HCL pH 7.4, 2 mM MgClj, 50 mM NaCl solution. 
The reactions were incubated at 80°C for 5 minutes, then allowed lo cool 
5 slowly to room temperature over a one hour period. The DNA polymerization 
was initiated by the addition as a mix of 1 /i' of lOX synthesis buffer (100 mM 
Tris-HCl pH 7.4 50. mM MgClj, 10 mM ATP, 5 mM each dNTPs. 20 mM 
DTD, 0.5 ^L\ (8 units) of T4 DNA ligase and 1 n\ (1 unit) of T4 DNA 
polymerase (Boehringer Mannheim). After 2 hours at 3TC, 80 /il of IX TE 
10 was added. 5 ti\ of the diluted reaction mix was used to transform competent 
E. coli NM522 (ung*. dui*). 

Since a 40-80% mutation yield is normally obtained, 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced with 7-deaza- 
dGTP and Sequenase (U.S. Biochemical) (Sanger. F. et al.,Proc. Nail. Acad. 
1 5 Sci. USA 74:5463-5467 ( 1 977)). As a rule, the entire coding sequences of the 
Epo mutants were examined for the presence of unwanted mutation by 
sequencing or restriction enzyme mapping. 

Production of Wild-Type and Epo Muteins in Mammalian Cells. 
Cos7 cells grown to approx. 70% confluency were transfected with 10 /xg 
20 recombinant plasmid DNA per 10 cm dish using the calcium phosphate 
precipitation protocol (Kingston, R.E. etal.. In "Current Protocols in 
Molecular Biology". Green Publ. Assoc. & Wiley Interscience, N.Y.. vol. 1, 
pp. 9.1.1-9.1.9 (1991))- As a control of iransfection efficiency, in several 
experiments 2 fig of pCHllO plasmid (Pharmacia) was co-transfected and ^- 
25 galactosidase activity measured in the cytoplasmic extracts. 

RNA Blot-Hybridimion Analysis. Total RNAs were prepared from 
cultured Cos7 cells (Chirgwin, J.M. et si.. Biochemistry 78:5294-5299 (1979)) 
and 2 /ig samples electrophorescd on a agarose gel wjiiUiiuiiig 2.2 M 
formaldehyde. Transfer to GeneScreen Plus filters (New England Nuclear) 
30 and hybridization with a "P-labeled wt Epo probe were carried out as 
previously described (Faquin, W.C. et al.. Blood 79:1987-1994 (1992)). 
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Quantitation of Transiently-Expressed Recombinant Epos. The 
amount of secreted protein in the supematants of transfected Cos? was 
determined by a radioimmune assay (RIA). The RIA was performed using a 
high liter rabbit polyclonal antiserum raised against the human wild-type Epo 

5 and produced in our laboratory. '"I-labeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin, W.C. et ai, Exp. Hematol. 21: 420-426 (1993)). 

Bioassays. The dose-dependent proliferation activities of wt and Epo 
muteins were assayed in vitro using three different target cells: murine spleen 

10 cells, following a modification of the method of Krystal (Krystal, G., Exp. 
Hematol. 77:649-660 (1983); Goldberg, M.A. et al.. Proc. Natl. Acad. Sd. 
USA 84:1912-1916 (1987)); murine Epo re^nsive murine cell line, 
developed by Hankins (Hankins, W.D. et ai. Blood 70:173a (1987)); and 
human Epo-depcndent UT-7/Epo cell line, derived from the bone marrow of 

15 a patient with acute megakaryoblastic leukemia (Komaisu, N et al., Cancer 
Res. 57:341-348 (1991)). After 22 to 72 hours of incubation with increasing 
amounts of recombinant proteins, cellular growth was determined by pH]- 
thymidine (New England Nuclear) uptake or by the colorimetric MTT assay 
(Sigma) (Yoshimura, A. et al., Proc. natl. Acad. Sci. USA 87:4139-4143 

20 (1990)). 

RESULTS AND DISCUSSION 

Our objective is to identify the functionally imporUnt domains on the 
Epo molecule by preparing muteins that have altered specific activity, i.e. 
significantly high or low biological activity per unit mass of protein. Such 
25 muteins must satisfy the following criteria: efficient secretion by Cos7 cells; 
luli recogniiion (near iiorniai uiiiuiiig anmay) by uit polyCiOual anii-EpO 
antiserum that we use in our radio-immune assay; preservation of the overall 
folding and three dimensional structure. In order to meet these criteria, the 
muteins that we employ in this study have only single amino acid 
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replacements. A comprehensive amino acid replacement scan of Epo would 
of necessity require the preparation and testing of a minimum of 166 muteins. 
Such an undertaking would be prohibitively labor intensive and costly. 

In order to devise a rational strategy to reduce the number of required 
5 replacements, we have taken advantage of information we have obtained 
employing scanning deletion muteins (Sec Example II). On the basis of its 
primary amino acid sequence and disulfide bonds Epo, in common with other 
members of the cytokine family is predicted to have a four antiparallel 
amphipathic a-helical bundle structure. We have shown that deletions in non- 
10 helical regions at the N-terminus, the C-terminus and in loops between helices 
result in the formation of protein that is readily secreted from the cell and is 
biologically active. These regions can be ruled out as functionally important 
domains such as the sites involved in the binding of Epo to its receptor. These 
results suggest that the functionally important contact sites are likely to reside 
15 on the predicted a helices. Furthermore, since the amphipathic helices are 
predicted to bind to one another along their hydrophobic surfaces, the 
biologically relevant contact sites are likely to be residues predicted to be on 
the external surfaces of the helices. These considerations greatly restrict the 
number of mutations needed to provide a comprehensive and informative body 
20 of data. 

Because structure-function studies on growth hormone and IL-4 have 
indicated that Helices A and D are involved in receptor binding, most of our 
muteins have been directed to these regions. 

Results on Helix A muteins are shown in Figure 17. The top panel 

25 shows a helical wheel projection. Replacement by alanine of the predicted 
external residues 13, 18, 20 and 21 have no significant impact on biological 
activity as determined by the relative incorporation of ['H]-thymidine into Epo 
dependent HCJD57 cells. However, the replacement of Argi4 by Aia showed 
markedly impaired biological activity. 

30 Figure 18, top panel, shows a helical wheel projection for Helix D. 

Nine of the nine predicted external residues were replaced by alanine. 
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Mutations at positions 136, 139 and 140 gave rise to proteins that were 
efficiently secreted and had full biological activity (Figure 18, bonom). in 
contrast Ala replacement of either Lysl40, Argl43, Serl46, Asnl47 or 
Lysl54 resulted in muteins with significantly (3-fold) increased biological 
5 activity. Replacement of Asnl47 with Ala resulted in the greatest increase in 
biological activity. (Figures 23-25). 

Because the C-terminal boundary of Helix D is uncertain, we tested 
alanine replacements at positions 152, 154 and 156 which are predicted to be 
in the adjacent non-helical segment at the C-terminus. As shown in Figure 19, 
10 the Ala replacement at Lysl54 had slightly increased biological activity while 
the Ala replacement at Tyrl56 had slightly decreased biological activity. 

The experiments reported above all involve testing the function of 
human Epo muteins on the mouse Epo responsive HCD57 cell line. Figure 20 
shows a comparative display of those muteins that appear to have higher 
15 biological activity than wild-type Epo. 

In order to assess the significance of these results we have also tested 
these muteins in a more physiologic system, mouse erythroid progenitors 
prepared from the spleens of animals challenged with phenylhydrazine. This 
bioassay obviates the possible confounding effecu of a malignant clonal cell 
20 line such as HCD57 which was derived from mouse crythroleukemia virus. 

As shown in Figure 21, results with erythroid cells from mouse spleen 
were very similar to those obtained with HCD57 cells. 

In addition we have tested these muteins in an Epo responsive human 
cell line UT-7/Epo, derived from a patient with acute leukemia. This cell line 
25 provides the opportunity of examining the interaction of human muteins with 
the human Epo receptor. As shown of ordinary in Figure 22. these results 
closely parallel those with the two mouse cell bioassays. 

li shOiiiu bt TiOtcu that in Several Of the uOSc fcSpOiiSc buivc^, at lllC 

highest concentrations of Epo tested, there is a significant drop-off in the 
30 incorporation of ['H]-thymidine. This is due to the fact that high levels of 
biologically active Epo causes terminal maturation and cessation of division. 
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Because of this possibly confounding phenomenon, it is important to have, as 
in the experiments that we report here, a full dose response curve for each 
mutein with an adequate number of data points. 

Taken together our results show that a restricted number of residues on 

5 Helix A and on Helix D are involved in Epo's biological activity. Further 
experiments are in progress to determine if the observed alterations in 
biological activity can be explained by parallel changes in the binding affinity 
to the Epo receptor. In addition, at the sites that we have identified as 
involved in Epo's biological activity, we plan to make a set of additional 

10 replacement muteins in order to further refine the ligand-receptor interaction. 
We are particularly interested in developing muteins with maximally increased 
biological activity, e.g. by preparing compound or multiplex replacements, 
each of which confers optimal enhancement of biological activity. Such a 
"superEpo" could have improved utility in the therapy of anemias. 

15 Conclusions 

We have produced a number of erythropoietin muteins comprising 
native erythropoietin with one or more modifications which improves the 
biological activity. 

All references mentioned herein are incorporated by reference into this 
20 disclosure. 

Having now fiilly described the invention by way of illustration and 
example for purposes of clarity and understanding, it will be j^parent to those 
of ordinary skill in the an that certain changes and modifications may be 
practiced within the scope of the invention, as limited only by the following 
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SEOUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Brigham and Women's Hospital 
75 Francis Street 
Boston, MA 02115 

INVENTORS: Boissel, Jean-Paul R. 

Burm, H. Franklin 
Wen, Danyi 
ShowerE, Mark 0. 

(ii) TITLE OF INVENTION: Erythropoietin Muteino With Enhanced 
Activity 

(iii) NUMBER OF SEQUENCBS: 59 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein k Pox 

(B) STREET: 1100 New York Avenue, Suite 600 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20005-3934 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version Sl.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (to be assigned) 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/049,802 
<B) FILING DATE: 21-APR-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

• (A) NAME: Cirabala, Michele A. 

(B) REGISTRATION NUMBER: 33,851 

(C) REFERENCE/DOCKET NUMBER; 0627.336PC01 

(ix)' TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 371-2600 

(B) TELEFAX: (202) 371-2540 



(2) INFORMATION FOR SEC ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
ATWGAATGAA SGC 

(2) INFORMATION FOR SEQ ID NO : 2 : 



13 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:; 
AWGGTNSGGG 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
TGCTTCTGCT ATCTTTGCTG CTGC 24 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; both 



(A) NAME/KEY: Peptide 

(B) LOCATION: 12. .13 

(D) OTHER INFORMATION: /label= Peptide 

/note= "The amino acid at position 12 may also be 
ieoleucine. The amino acid at position 13 may 
also be aspartic acid." 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Asp Thr Lye Val Asn Phe Tyr Ala Trp Lys Arg Met Glu Val Gly 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TGAAGTTTGG CCGAGAAGTG GATGC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
AAGAKGTACC TCTCCAGRAC TCG 
(2) INFORMATION FOR SEO ID NO : 7 ; 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
CTGCTCCACT CCGAACAKTC AC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTGGAGTGTC CATGGGACAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
AGGCGCGGAG ATGGGGGTGC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(XI ) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 
TGGTAGCTGG GGGTGGGGTG TGCACACGGC AGCAGGATTG AATGAAGGCC AGGGAGGCAG 
CACCTGAGTG CTTGCATGGT TGGGGACAGG AAGGACGAGC TGGGGCAGAG ACGTGGGATG 
AAGGAAGCTG TCCTTCCACA GGCCACCCTT CTCCCTCCCC GCCTGACTCT CAGCCTGGCT 
ATCTGTTC7A GAATGTCCTG CCTGGCTGTG GCTTCTCCTG TCCCTGCTGT CGCTCCCTCT 
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GGGCCTCCCA GTCCTGGGCG CCCCACCACG CCTCATCTGT GACAGC 
(2) INFORMATION FOR SEQ ID NO; 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 0 base paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO ill: 
CTCGTAGTTG TTGGGTGGCG TGTGCACGCG GTCAGTGTAT TGAXTGAAGG CAAAGGAGGC 
AGAGCCTGCG CGCTCGCAAG GTTGGGGTCG GAAACGGCTG GCTGGGGGCA AGACGGCCGG 
ATGGGTGAAC CTGTCCGTCT AAACCAACCC TTCTCCTGCA ATGCCCAGCC TCACATTCAG 
CCTGGCTCTC TTTCCTAGAA TGTCCTGCCC GGCTGCTTCT GCTATCTTTG CTGCTGCTTC 
CTCTGGGCCT CCCAGTCCTG GGCGCCCCCC CACGCCTCAT CTGTGACAGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS: single 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
GTAGCTCGGG GGGTGGGGTG TGCACGCGGC CGCGGGATTG AATGAAGGCA AAGGAGGCAG 
AGCCTGAGCG CTCGCAAGGT TCGGGTCGGG AGGACTCGCG GGGCCAGAGC AGCGGGATGG 
GTGAACGTGC AGGTCCAAAC CACCCCTTCT CCTGCAAGTC CCGGCCTCAC ACTCAGCCCA 
GCTCTCTCTC CTAGAATGTC TTCTGCTGCT TCTGCTGCCC TCCTTGCTGC TGCTTCCTCT 
GGGCCTCCCA GTCCTGGGCG CCCCCACACA CCTCATCTGT GACAGC 

<2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGTAGCTGT TGGGGTGGGG TGCGCACGAA GCCGCGGGAT TGAATGAACG CGGAGGCAGA 
GCTGAGCGCT CCCAAGGTCC GGGTGGGGAG GGCTCACTGG GGGCTGCGCA GCGGGATGGG 
TGAACCTGCC GGTCCAAAGC AGCCCTTCTC CTGCGAGAAT GCACCTCATC CGCAGCCCGT 
CTCTCTTTCC TAGACTGTAC TCCGCTGCTG CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT 
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CTGGGCCTCC CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGC 

(2) INFORMATION FOR EEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 base pairs 
<B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: both 



<xi) SEQUENCE DESCRIPTIOK: SEQ ID N0:14: 

TGGGGGTGGG GTGTGCAGGC GGCAGCAGGA TTGAATGAAG GCAGAGGAGA CAGACCCTGA 60 

GCGCTGGGAA GGTTGGGGGC AGGAGCCACT AGCTGGGGGC AGAGGAGGGG GATGCGTGAA 120 

CCTGCCCCTC CAAACCAACC CTCCCCTTCC CCTCCCTGCC TCACACTCAG CCCGGATCTC 18 0 

CTTTCCAGAA TGTCCTGCCC TGCTGCTTCT GCTGTCCCTG CTACTGCCTC CTCTGGGCCT 24 0 

CCCAGCCCTG GGCGCCCCTC CACGCCTCAT CTGTGACACC 28 0 
(2) INFORMATION FOR SEO ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 baee paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

CAGGTCAGTT TGATTGGGTG GATGTGTGCC GCGAAGCAGC GGCCGGATTG AATGAAGGCA 6 0 

GGGGAGGCAG AACCTGAACG CTGGGAAGGT GGGGGTCGGG CGCGACTAGT TGGGGGCAGA 120 

GGAGCGGGAT GTGTGAACCT GCCCCTCCAA ACCCACACAG TCAGCCTGGC ACTCTTTTCC 18 0 

AGAATGTCCT GCCCTGTTCC TTTTGCTGTC TTTGCTGCTG CTTCCTCTGG ACCTCCCAGT 24 0 

CCTGGGCGCC CCCCCTCGCC TCATTTGTGA CAGC 274 

(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TGGTAGTTTG GCGGGTGGTG TGTGCTCACC GCGGCGGCGG CGGGGATTGA ATGAAGGCCA 6 0 

AGGAGGCAGA ACCCGAGCGC TGGGAAGGTT CGGGGTGGGA GCGACAAGCT GGGGGCAGAG 12 0 

GAACGGGATG TGTGAACCTG CTCCTCCAAA CACACTCAGC CTGGCACTCT TTTCTAGAAT 18 0 

GTCCTGCCCT GCTGCTTCTG CTATCTTTGC TGCTGCTTCC CCTGGGCCTC CCAGTCCTGG 24 0 

GCGCCCCCCC TCGCCTCATC TGTGACAGC 269 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 base pairt 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEaUENCE DESCRIPTION: SEQ ID NO: 17: 
GGTAGTTTGG CGGGTGGGGT GTGCTCACCG CGGCGGCGGC GGGGATTGAA TGAAGGCCAA 
GGAGGCAGAA CCCGAGCGCT GGGAAGGTTC GGGGTGGGAG CGACAAGCTG GGGGCAGAGG 
AACGGGATGT GTGAACCTGC TGCTCCAAAC ACACTCAGCC TGGCACTCTT TTCTAGAATG 
TCCTGCCCTG CTGCTTCTGC TATCTTTGCT GCTGCTTCCC CTGGGCCTCC CAGTCCTGGG 
CGCCCCCCCT CGCCTCATCT GTGACAGC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CGGTCGCTTG GGGGGTGGGG TGTGCAGCGC GGAGGGATTG AATGAAGGCC ACTCAGGCAG 
AGCCTAAGCA ATTGCAAGGT CGGGGTCAGC AGAGACTATA AGGGCAGAGG GGTCTCGCTG 
AGCCAACCGG CCCCTGCAGT CCCACGGCCC CCTCCCCTCT CGGCCTCACA CTCAGCCTGC 
CTTTCTTCCA GAACGTCCCA CCCTGCTGCT TTTACTCTCC TTGCTACTGA TTCCTCTGGG 
CCTCCCAGTC CTCTGTGCTC CCCCACGCCT CATCTGCGAC AGT 
(2) INFORMATION FOR SEQ ID NO: 19: 

( i ) ■ SEQUENCE CHARACTER! ST I CS : 

(A) LENGTH: 283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
ATGGTGGCTT GGGGGGTGGT GTGTGCAGCG CCGGGAGATA GAATGAAGGC CACTCAGGCA 
GAGCCTAAGC AGTTGCAAGG TCGGGGTCAG CAGAGACTGG AAGGGCAGAG GAGCCTCGCT 
GTGCAAACCG GTCTCTGGTC GTCCCACGGC CCTCCCCTCC CAGCCTCACA CCCAGCCTGC 
CTTTCTTCTA GAACGTCCAA CCCTGCTGCT TTTGCTGTCT TTGCTCCTGC TTCCCCTGGG 
CCTCCCAGTC CTCTGCGCTC CCCCACGCCT CATCTGCGAC AGC 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH; 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Bingle 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGGTGGTGGC GATGAATGAA GCAGCAGCTA GGTGGGGGGC GGTGAAGTAC AGCCGTCTCA 6 0 

GAGTGTCTTC TCTCTGCCCT GGCCTCCCAG CTGGCCCCCC CTCATTGGAC AG 112 
(2) INFORMATION FOR SEQ ID NO; 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; both 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 21: 

AGGCGCGGAG ATGGGGGTGC ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCCCrrGCT 6 0 

GTCGCTCCCT CTGGGCCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT GTGACAGCCG 120 

AGTCCTGGAG AGGTACCTCT TGGAGGCCAA GGAGGCCGAG AATATCACGA CGGGCTGTGC 18 0 

TGAACACTGC AGCTTGAATG AGAATATCAC TGTCCCAGAC ACCAAAGTTA ATTTCTATGC 24 0 

CTGGAAGAGG ATGGAGGTCG GGCAGCAGGC CGTAGAAGTC TGGCAGGGCC TGGCCCTGCrr 3 00 

GTCGGAAGCT GTCCTGCGGG GCCAGGCCCT GTTGGTCAAC TCTTCCCAGC CGTGGGAGCC 3 60 

CCTGCAGCTG CATGTGGATA AAGCCGTCAG TGGCCTTCGC AGCCTCACCA CTCTGCTTCG 42 0 

GGCTCTGGGA GCCCAGAAAG AAGCCATCTC CCCTCCAGAT GCGGCCTCAG CTTGCTCCACT 46 0 

CCGAACAATC ACTGCTGACA CTTTCCGCAA ACTCTTCCGA GTCTACTCCA ATTTCCTCCG 54 0 

GGGAAAG(rrG AAGCTGTACA CAGGGGAGGC CTGCAGGACA GGGGACAGAT GACCAGGTGT 600 

GTCCACCTGG GCATATCCAC CACCTCCCTC ACCAACATTG CTTGTGCCAC ACCCTCCCCC 66 0 

GGCCACTCCT GAACCCCGTC GAGGGGCTCT CAGCTCAGCG CCAGCCTGTC CCATGGACAC 72 0 

TCCAG 725 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCTCTCGT GTCGCTCCCT CTGGGCCTCC €0 

CAGTCCCGGG CGCCCCACCA CGCCTCGTCT GTGACAGCCG AGTCCTGGAG AGGTACCTCT 12 0 

TGGAGGCCAA GGAGGCCGAG AATGTCACGA TGGGCTGTTC CGAAAGCTGC AGCTTGAATG 18 0 



wo 94/24 160 



PCTAJS94/04361 



- 78 - 

AGAATATCAC CGTCCCAGAC ACCAAAGTTA ACTTCTATGC CTGGAAGAGG ATAGAGGTCG 24 0 

GGCAGCAGGC TGTAGAAGTC TGGCAGGGCC TGGCCCTGCT CTCAGAAGCT GTCCTGCGGG 3 00 

GCCAGGCCGT GTTGGCCAAC TCTTCCCAGC CTTTCGAGCC CCTGCAGCTG CACATGGATA 360 

AAGCCATCAG TGGCCTTCGC AGCATCACCA CTCTGCTTCG GGCGCTGGGA GCCCAGGAAG 42 0 

CCATCTCCCT CCCAGATGCG GCCTCGGCTG CTCCACTCCG AACCATCACT GCTGACACTT 4 80 

TCTGCAAACT CTTCCGAGTC TACTCCAATT TCCTCCGGGG AAAGCTGAAG CTGTACACGG 54 0 

GGGAGGCCTG CAGGAGAGGG GACAGATGAC CAGGTGCGTC CAGCTGGGCA CATCCACCAA SOO 

CTCCCTCACC AACACTGCCT GTGCCACACC CTCCCTCACC ACTCCCGAAC CCCATCGAGG 660 

GGCTCTCAGC TAAGCGCCAG C 681 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CCGAACGTCC CACCCTGCTG CTTTTACTCT CCTTGCTACT GATTCCTCTG GGCCTCCCAG 60 

TCCTCTGTGC TCCCCCACGC CTCATCTGCG ACAGTCGAGT TCTGGAGAGG TACATCTTAG 120 

AGGCCAAGGA GGCAGAAAAT GTCACGATGG GTTGTGCAGA AGGTCCCAGA CTGAGTGAAA 18 0 

ATATTACAGT CCCAGATACC AAAGTCAACT TCTATGCTTG GAAAACyiATG GAGGTGC^AG 24 0 

AACAGGCCAT AGAAGTTTGG CAAGGCCTGT CCCTGCTCTC AGAAGCCATC CTGCAGGCCC 3 00 

AGGCCCTGCT AGCCAATTCC TCCCAGCCAC CAGAGACCCT TCAGCTTCAT ATAGACAAAG 36 0 

CCATCAGTGG TCTACGTAGC CTCACTTCAC TGCTTCGGGT ACTGGGAGCT CAGAAGGAAT 420 

TGATGTCGCC TCCAGATACC ACCCCACCTG CTCCACTCCG AACACTCACA GTGGATACTT 48 0 

TCTGCAAGCT CTTCCGGGTC TACGCCAACT TCCTCCGGGG GAAACTGAAG CTGTACACGG 54 0 

GAGAGGTCTG CAGGAGAGGG GACAGGTGAC ATGCTGCTGC CACCGTGGTG GACCGACGAA 6 00 

CTTGCTCCCC GTCACTGTGT CATGCCAACC CTCCACCACT CCCAACCCTC ATCAAACGGG 66 0 

TCATTACCTT CTTACCAGT 679 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CCGAACGTCC CACCCTGCTG CTTTTACTAT CCTTGCTACT GATTCCTCTG GGCCTCCCAG 6 0 

TCCTCTGCGC TCCCCCACGC CTCATTTGCG ACAGTCGCGT TCTGGAGAGG TACATCTTGG 12 0 
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AGGCCAAGGA GGCAGAAAAT GTCACAATGG GCTGTGCAGA AGGTCCCAGA CTGAGTGAGA 18 0 

ATATTACCGT CCCAGATACC AAAGTCAACT TCTACGCTTG GAAAAGAATG AAGGTGOAAG 24 0 

AACAGGCTGT AGAAGTTTGG CAAGGCCTGT CTCTGCTCTC AGAAGCCATC CTGCAGGCCC 3 00 

AGGCTCTGCA GGCCAATTCC TCCCAGCCAC CAGAGAGTCT TCAGCTTCAT ATACSACAAAG 3 60 

CCATCAGTGG GCTACGTAGC CTCACTTCAC TGCTTCGGGT GCTGGGAGCT CAGAAGGAAT 42 0 

TGATGTCGCC TCCAGACGCC ACCCAAGCCG CTCCACTCCG AACACTCACA GCGGATACTT 48 0 

TCTGCAAGCT CTTCCGGGTC TACTCCAACT TCCTCCGGGG GAAACTGAAG CTGTACACGG 54 0 

GGGAGGCCTG CAGGAGAGGG GACAG<3TGAC CTGCCACTGC CGTGTACCCG CCAACTCGCT 6 00 

CACCGTCACT GTGTCACGCC AACCCTCCAC CACTCCCAAC CCTCATCAAA CGGGGTTGTT 66 0 

TGTTACCTTC TTACCGGC 678 
(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:25: 

GCGACTGTAC TCCGCTGCTG CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT CTGGGCCTCC 6 0 

CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGCCG AGTCCTGGAG AGGTACATCC 12 0 

TGGAGGCCAG GGAGGCCGAA AATGCCACGA TGGGCTGTGC AGAAGGCTGC AGCTTCAGTG 180 

AGAATATCAC TGTCCCAGAC ACCAAGGTTA ACTTCTATGC CTGGAAGAGG ATGGAGGTCC 24 0 

AGCAGCAGGC TCTGGAAGTC TGGCAGGGCC TGGCTCTGCT CTCAGAAGCT ATCTTTCGGG 3 00 

GCCAGGCCCT ACCGGCCAAC GCATCCCAGC CATGCGAGGC CCTGCGGTTG CACGTGGACA 360 

AAGCTGTCAG CGGCCTCCGC AGTCTCACCT CCCTGCTTCG GGCGCTGGGA GCCCAGAAAG 420 

AAGCCATCCC- CCTTCCAGAT GCAACCCCCT CCGCAGCCCC ACTCCGAATA TTCACTGTTG 480 

ATGCTTTGTC CAAGCTCTTC CGAATCTACT CCAATTTCCT GAGGGGAAAG CTGACGCTGT 54 0 

ACACAGGGGA GGCCTGCAGG AGAGGGGACA GGTGACCCAG TGCTATCACC CCGGGCACGT 6 00 

CCATCACCTC GCTCACCATC ACTGCCTACG CCATGCCTTC CACGCCGCCA CTCCCAACCC 66 0 

CTGTCGACGA CGGGCCACCA GCTCAGT 687 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base pairs 

(B) TYPE:_nucleic acid 
(l) & ikAKu^DNESS : single 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2€ : 
GAATGTCCTG CCCGGCTGCT TCTGCTATCT TTGCTGCTGC TTCCTCTGGG CCTCCCAGTC 6 0 
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CTGGGCGCCC CCCCACGCCT CATCTGTGAC AGCCGAGTCC TGGAGAGGTA CATCTTGGAG 12 0 

GCCAAGGAGG GCGAAAATGC CACGATGGGC TGTGCCGAAA GCTGCAGCTT CAGTGAGAAT ISO 

ATCACTGTCC CAGACACCAA GGTTAACTTC TATGCCTGGA AGAGGATGGA GGTCCAGCAG 24 0 
CAGGCCATGG AGGTCTGGCA GGGTCTGGCC CTGCTCTCAG AAGCCATCCT GCAGGGCCAG 
GCCCTGTTGG CCAACTCCTC CCAGCCATCT GAGGCCCTGC AGCTGCATGT GGACAAAGCT 

GTCAGCGGCC TGCGCAGCCT CACCTCCCTG CTTCGGGCGC TGGGAGCCCA GAAGGAAGCC 420 

ATCCCCCTTC CAGACGCATC CCCCTCCTCT GCCACCCCAC TCCGAACATT TGCTGTTGAT 48 0 

ACTTTGTGCA AACTTTTCCG CAACTACTCC AATTTCCTGC GGGGAAAGCT GACGCTGTAC 54 0 

ACTGGAGAGG CCTGCAGGAG AAGGGACAGG TGACTTGGTG CTCCCACCCG GGGCATGTCC 600 

ACCACTTCGC TCACCACCAC TGCCTGTGCC ATGCCTTCTG CACCTCCACT CCCAACCCCC 660 
GCTGAGGGGC CATCAGCTCA GCGCCAGC 
(2) INFORMATION FOR SEQ ID NO -.27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) T0P0U5Gy: both 



300 
360 



(xi) SEQUEKCE DESCRIPTION: SEQ ID NO:2'7: 



GAATGTCCTG 


CCCTGCTGCT 


TCTGCTATCT TTGCTGCTGC TTCCCCTGGG 


CCTCCCAGTC 


60 


CTGGGCGCCC 


CCCCTCGCCT 


CATCTGTGAC 


AGCCGAGTCC 


TGGAGAGGTA 


CATTCTGGAG 


120 


GCCAGGGAGG 


CCGAAAATGT 


CACGATGGGC 


TGTGCTGAAG 


GCTGCAGCTT 


CAGTGAGAAT 


180 


ATCACTGTCC 


CAGACACCAA 


GGTCAACTTC 


TATACCTGGA 


AGAGGATGCA 


CGTCGGGCAG 


240 


CAGGCTGTGG AAGTCTGGCA 


GGGCCTCGCC 


CTGCTCTCAG 


AAGCCATCCT 


GCGGGGCCAG 


300 


GCCCTGCTGG 


CCAACTCCTC 


CCAGCCATCT 


GAGACCCTGC 


AGCTGCATGT 


GGATAAAGCC 


360 


GTCAGCAGCC 


TGCGCAGCCT 


CACCTCCCTG 


CTTCGGGCAC 


TGGGAGCCCA 


GAAGGAAGCC 


420 


ACCTCCCTTC 


CAGAGGCAAC 


CTCTGCTGCT 


CCACTCCGAA 


CATTCACTGT 


CGATACTTTG 


480 


TGCAAACTTT 


TCCGAATCTA 


CTCCAACTTC 


CTGCGGGGAA 


AGCTGACGCT 


GTACACAGGG 


54 0 


GAGGCCTGCC 


GAAGAGGAGA 


CAGGTGACCA 


GGTGCTCCTA 


CCCCGGGCAT 


GTCCACCACC 


600 


TCACTTACCA 


CCAATGCCTG 


TGCCACGCCC 


TCTGCACCAC 


CACTCCTGAC 


CCCTGTCGGG 


660 


GGTGATCAGC 


TCAGCACCAG 


C 








681 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28 
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Met Gly Val Hj.6 Glu Cye Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 

15 10 15 

Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 30 ainino acids 

(B) TYPE: amino acid 
{D> TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 29: 

Met Gly Val His Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
IS 10 15 

Val Ser L€U Pro Leu Gly Leu Pro Val Pro Gly Ala Pro Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO; 30: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 29 amino acide 
(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 30: 

Met Gly Val Pro Glu Arg Pro Thr Leu Leu Leu Leu Leu Ser Leu Leu 
15 10 15 

Leu He Pro Leu Gly Leu Pro Val Leu Cys Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 31: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 30^ amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi)- SEQUENCE DESCRIPTION: SEQ ID N0:31: 

Met Gly Val Arg Asp Cye Thr Pro Leu Leu Leu Leu Leu Leu Ser Leu 
15 10 IS 

Leu Leu Phe Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO; 32: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 25 amino acids 

(D) TOPOLOGY t both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Glu Cys Pro Ala Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
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15 10 15 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO:33: 

Glu eye Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leu Pro Pro Leu 
IS 10 15 

Gly Leu Pro Ala Leu Gly Ala Pro Pro 
20 25 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 34: 

Glu Cy6 Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
15 10 15 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
• (D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Glu Cys Pro Ala Leu Phe Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 

15 10 15 

Agp Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 36; 
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Glu Cye Leu Leu Leu Leu Leu Leu Pro Ser Leu Leu Leu Leu Pro Leu 

15 10 15 

Gly Leu Pro Val Leu Gly Ala Pro Pro 

20 25 

(2) INFORMATION FOR SEO ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lys Glu Ala Glu Aen He Thr Thr Gly Cys Ala Glu His 
20 25 30 

Cya Ser Leu Asn Glu Aen He Thr Val Pro Aep Thr Lye Val Aan Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Val Asn Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
115 120 125 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cye Arg Thr Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

1 " s ' " lo" "15* 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 
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Tyr Ala Trp Lys Arg Met Glu Val 
50 55 

Gin Gly Leu Ala Leu Leu Ser Glu 
65 70 

Leu Ala Aan Ser Ser Gin Pro Phe 
85 

Lye Ala lie Ser Gly Leu Arg Ser 
100 

Gly Ala Gin Glu Ala He Ser Leu 
115 120 

Leu Arg Thr He Thr Ala Asp Thr 
130 135 

Ser Aen Phe Leu Arg Gly Lye Leu 
145 150 

Arg Arg Gly Asp Arg 
165 
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Gly Gin Gin Ala Val Glu Val Trp 
£0 

Ala Val Leu Arg Gly Gin Ala Val 
75 80 

Glu Pro Leu Gin Leu His Met Asp 
90 95 

He Thr Thr Leu Leu Arg Ala Leu 
105 110 

Pro Aep Ala Ala Ser Ala Ala Pro 
125 

Phe Cya Lye Leu Phe Arg Val Tyr 
140 

Lye Leu Tyr Thr Gly Glu Ala Cys 

155 160 



(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Ala Pro Pro Arg Leu Val Cys Aap Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lye Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

Cye Ser Leu Aen Glu Aen He Thr Val Pro Asp Thr Lye Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg He Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin <3ly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Val 
65 70 75 80 

Leu Ala Aen Ser Ser Gin Pro Phe Glu Pro Leu Gin Leu His Met Asp 
85 90 95 

Lys Ala He Ser Gly Leu Arg Ser He Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Glu Ala He Ser Leu Pro Asp Ala Ala Ser Ala Ala Pro 

115 120 125 

Leu Ara Thr He Thr Ala Asp Thr Phe Cvs Lvs Leu Phe Arg Val Tyr 
130 135 140 

Ser Aen Phe Leu Arg Gly Lye Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
145 150 155 160 

Arg Arg Gly Asp Arg 
165 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO:40 



Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr He 

15 10 IS 

Leu Glu Ala Lye Glu Ala Glu Asn Val Thr Met Gly Cye Ala Glu Gly 
20 25 30 

Pro Arg Leu Ser Glu Asn He Thr Val Pro Aap Thr Lye Val Asn Phe 
35 40 45 

Tyr Ala Trp Lye Arg Met Glu Val Glu Glu Gin Ala lie Glu Val Trp 

50 55 60 

Gin Gly Leu Ser Leu Leu Ser Glu Ala He Leu Gin Ala Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Pro Glu Thr Leu Gin Leu Hia He Asp 

85 90 95 

Lys Ala He Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 
100 105 110 

Gly Ala Gin Lye Glu Leu Met Ser Pro Pro Asp Thr Thr Pro Pro Ala 



Cys Arg Arg Gly Asp Arg 



(2) INFORMATION FOR SEO ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 16 6 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

Ala Pro Pro Arg Leu He Cye Aep Ser Arg Val Leu Glu Arg Tyr He 

15 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cye Ala Glu Gly 

20 25 30 

Pro Arg Leu Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Aen Phe 

35 4C 45 

Tyr Ala Trp Lys Arg Met Lys Val Glu Glu Gin Ala Val Glu Val Trp 

50 55 60 

Gin Gly Leu Ser Leu Leu Ser Glu Ala lie Leu Gin Ala Gin Ala Leu 
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65 70 75 80 

Gin Ala Asn Ser Ser Gin Pro Pro Glu Ser Leu Gin Leu Hie lie Asp 
85 90 95 

Lys Ala He Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 
ICC 105 110 

Gly Ala Gin Lys Glu Leu Met Ser Pro Pro Asp Ala Thr Gin Ala Ala 
115 120 125 

Pro Leu Arg Thr Leu Thr Ala Asp Thr Phe Cys Lys Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cys Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr lie 

15 10 15 

Leu Glu Ala Arg Glu Ala Glu Asn Ala Thr Met Gly Cys Ala Glu Gly 
20 25 30 

Cys Ser Phe Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Aen Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gin Gin Gin Ala Leu Glu Val Trp 
SO 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Phe Arg Gly Gin Ala Leu 
65 70 75 80 

Pro Ala Aen Ala Ser Gin Pro Cys Glu Ala Leu Arg Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Pro Leu Pro Asp Ala Thr Pro Ser Ala 
115 120 125 

Ala Pro Leu Arg lie Phe Thr Val Asp Ala Leu Ser Lys Leu Phe Arg 
130 135 140 

He Tyr Ser Asn Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly Glu 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
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(D! TOPOLOGY: both 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Ala Pro Pro Arg Leu lie CyB Aep Ser Arg Val Leu Glu Arg Tyr lie 

15 10 15 

Leu Glu Ala LyB Glu Gly Glu Asn Ala Thr Met Gly Cye Ala Glu Ser 
20 25 30 

Cys Ser Phe Ser Glu Asn lie Thr Val Pro Aep Thr Lye Val Aon Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gin Gin Gin Ala Met Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala lie Leu Gin Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Ser Glu Ala Leu Gin Leu His Val Asp 
85 90 95 

LyB Ala Val Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 



Arg Asn Tyr Ser Asn Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly 
145 150 155 160 

Glu Ala Cys Arg Arg Arg Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) ■ SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Arg Glu Ala Glu Asn Val Thr Met Gly Cye Ala Glu Gly 

20 25 30 

Cys Ser Phe Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tvr Thr Trp Lvs Arq Met Aep Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Ser Glu Thr Leu Gin Leu His Val Asp 
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LvB Ala Val Ser Ser Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
100 110 

Glv Ala Gin Lys Glu Ala Thr Ser Leu Pro Glu Ala Thr Ser Ala Ala 
115 120 125 

Pro Leu Arg Thr Phe Thr Val Aep Thr Leu Cys Lye Leu Phe Arg lie 
130 136 140 

Tyr Ser Aen Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly Glu Ala 

ten IbO 

eye 

(2) INFORMATION FOR SEQ ID NO: 45; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Ala Pro Pro Arg Leu lie Cys Aep Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn lie Thr Thr Gly Cye Ala Glu Hie 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lye Val Asn Phe 
35 40 45 

Tyr Ala Trp LyB Arg Met Glu Gly Val Gin Gin Ala Val Glu Val Trp 
SO 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Val Aen Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lye Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
115 120 125 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 

Tvr Ser Asn Phe Leu Arg Gly Lys Leu LyE Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cys Arg Thr Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu Xaa Xaa Ala Lye 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val 
15 10 15 



(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Pro Leu Gin Leu Hie Val Asp Lye Ala Val Ser Gly Leu Arg Ser Leu 



(2) INFORMATION FOR SEQ ID NO:49: 



SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 amino acids 

•(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser Asn 



Phe Leu Arg Gly Lys 
20 

(2) INFORMATION FOR SEQ ID I 



(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 
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Glu Asn He Thr Thr Gly CyE Ala Glu His Cye Ser Leu Aen Glu Aen 

lie Thr Val Pro Asp Thr Lya Val Aen Phe Tyr Ala Trp Lys Arg Aer. 
20 25 30 

Glu Val Gly Gin 
35 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51: 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lye Glu Ala He Ser 
15 ID 15 

Pro Pro Asp Ala Ala Ser Ala Ala Pro Leu Arg 

20 25 

(2) INFORMATION FOR SEQ ID NO: 52: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lye Leu lie Ser Glu 
15 10 15 

Glu Asp Leu Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala Pro 
20 25 30 

Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 53 : 
Cye Arg Thr Gly Asp Arg 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 54: 

Lys Asp Glu Leu 
1 

INFORMATION FOR SEQ ID NO: 55: 

(i! SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
His His His His Hie His 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

Met Gly His Ser Ser Gly His He Glu Gly Arg Hie Met Ala Pro Pro 



(2) INFORMATION FOR SEQ ID NOrB'?: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 12 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY; both 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO:57: 

Val 'Leu Glu Arg Tyr Leu Leu Glu Ala Lye Glu Ala 

15 10 

(2) INFORMATION FOR SEQ ID NO: 58; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 58: 
. — ^1 — Ti» 711, i,rr^ tv> ^ Vn n>-n l.vc I.Pii PHp Afo Val Tvr Ser 

V " V" " ^ ' ' 10 ' 

Asn Phe Leu Arg 

20 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: i: amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Gly Lyg Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
1 5 10 
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Claims: 



1 1. An erythropoieiinmutein comprising native erythropoietin with 

2 one or more of the following modifications: 

3 (a) replacement of the amino acid at position 20 with a first 

4 substitute amino acid; 

5 (b) replacement of the amino acid at position 49 with a 

6 second substitute amino acid; 

7 (c) replacement of the amino acid at position 73 with a 

8 third substitute amino acid; 

9 (d) replacement of the amino acid at position 140 with a 

10 fourth substitute amino acid; 

1 1 (e) replacement of the amino acid at position 143 with a 

12 fifth substitute amino acid; and 

13 (f) replacement of the amino acid at position 146 with a 

14 sixth substitute amino acid; 

15 (g) replacement of the amino acid at position 147 with a 

16 seventh substitute amino acid; 

17 (h) replacement of the amino acid at position 154 with a 

18 eighth substitute amino acid; 

19 wherein said erythropoietin mutein exhibits enhanced biological activity 

20 compared to said native erythropoietin. 

1 2. The erythropoietin mutein of claim 1 comprising native 

2 erythropoietin with one of said modifications. 

1 3. Tlie erythropoietin mutein of claim 1 wherein said first, second, 

3 from the group consisting of alanine, serine, threonine, and glycine. 
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1 4. The erythropoietin mutein of claim 3 wherein said first 

2 substitute amino acid is serine, and/or said second, third, fourth, fifth, sixth, 

3 seventh and/or eighth substitute amino acids are alanine. 

5 . The erythropoietin mutein of claim 2 wherein the amino acid at 
position 20 is replaced with the amino acid alanine. 

1 6. The erythropoietin mutein of claim 2 wherein the amino acid at 

2 position 49 is replaced with the amino acid serine. 

3 7 . The erythropoietin mutein of claim 2 wherein the amino acid at 

4 position 73 is replaced with the amino acid glycine. 

5 8. The erythropoietin mutein of claim 2 wherein the amino acid at 

6 position 140 is replaced with the amino acid alanine. 

7 9. The erythropoietin mutein of claim 2 wherein the amino acid at 

8 position 143 is replaced with the amino acid alanine. 

9 10. The erythropoietin mutein of claim 2 wherein the amino acid at 

10 position 146 is replaced with the amino acid alanine. 

11. The erythropoietin mutein of claim 2 wherein the amino acid at 
position 147 is replaced with the amino acid alanine. 

1 12. The erythropoietin mutein of claim 2 wherein die amino acid at 

2 position 154 is replaced with the amino acid alanine. 

1 13. A DNA molecule encoding the erythropoietin mutein of 

2 claim 1. 
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1 14. A recombinant DNA construct comprising the DNA molecule 

2 of claim 13. 

1 15. A vector comprising the DNA construct of claim 14 which is 

2 capable of expressing said erythropoietin mutein in a host cell. 



1 16. A host cell comprising the DNA molecule of claim 13 capable 

2 of expressing the erythropoietin mutein encoded by said DNA molecule. 

1 17. The host cell of claim 16 wherein said host cell is capable of 

2 glycosylating said erythropoietin mutein. 

1 18. The host cell of claim 17 wherein said host cell is a mammalian 

2 cell. 

1 19. The host cell of claim 18 wherein said host cell is selected from 

2 the group consisting of Chinese Hamster Ovary cells, Cos7 cells,Cosl cells, 

3 baby hamster kidney cells, and CVl cells. 

1 20. A method of stimulating erythrocyte production in a subject, 

2 wherein said method comprises administering to said subject an efficacious 

3 amount of the erythropoietin mutein of claim 1. 

1 21. A method of stimulating erythrocyte production in a subject, 

2 said method comprising administering to said subject an efficacious amount of 

3 an erythropoietin mutein comprising native erythropoietin with the amino acid 

4 residue at position 143 replaced by an alanine residue, wherein said efficacious 
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1 22. A method of inducing the proliferation of a cell culture 

2 responsive to erythropoietin, wherein said method comprises contacting said 

3 cell culture with the erythropoietin mutein of claim 1. 

1 23. A method of inducing the proliferation of a cell culture 

2 responsive to erythropoietin, wherein said method comprises contacting said 

3 cell culture with an efficacious amount of an erythropoietin mutein comprising 

4 native erythropoietin with the amino acid residue at position 143 replaced by 

5 an alanine residue, wherein said efficacious amount is less than the 

6 corresponding amount of native erythropoietin. 

1 24. The method of claim 18 or 19 wherein said cell culture 

2 comprises cells selected from the group consisting of murine spleen cells, 

3 murine erythroleukemia cells, and human UT-7/Epo cells. 

1 25. A method for obtaining erythropoietin muteins with enhanced 

2 biological activity relative to native erythropoietin, said method comprising the 

3 steps: 

4 (a) preparing erythropoietin muteins comprising native erythropoietin 

5 with a single amino acid substitution, said substitution occurring at a position 

6 selected from the group consisting of positions 48-52, 151-160, and positions 

7 predicted to reside on the external surface of helices A and D; 

8 (b) assaying the biological activity of the erythropoietin muteins from 

9 step (a) over a range of dosages; and 

10 (c) identifying those erythropoietin muteins with enhanced biological 

1 1 activity relative to native erythropoietin. 
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